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Preface 


This book is intended as a replacement for Galactic Astronomy by Mihalas 
& Binney, and a companion to Galactic Dynamics by Binney & Tremaine, 
which we henceforth refer to as BT. 

Nearly 19 years of rapid scientific progress have elapsed since Mihalas & 
Binney went to press. In these years, there have been fundamental changes 
in the way that we approach the study of galaxies. Spectacular advances in 
telescope and detector technology, and in the scale of space observatories, 
have enormously enriched the quality of the data that are available, partic- 
ularly for external galaxies. Although some details of galactic structure are 
still best studied using our uniquely close-up view of the Milky Way, the 
over-all picture can be seen most clearly from the perspective afforded by 
observations of other galaxies. The emphasis in studies of galactic structure 
has therefore shifted toward exploiting the wealth of extragalactic observa- 
tions that is now available, and the layout of this book has been altered from 
that of Mihalas & Binney to reflect this change. 

It remains important to understand the historical development of the 
subject, and essential to have a good grasp of fundamental astronomical con- 
cepts, so the first three chapters still cover the history of galactic astronomy, 
astronomical measurements and the phenomenology of stars. We then turn 
directly to the morphology of external galaxies - Chapter 4 gives a broad 
perspective on the different types of galaxies that populate the Universe. 
Chapter 5 reviews the theory of stellar structure and nucleosynthesis insofar 
as it bears on the ages and evolutionary histories of galaxies. The simplest 
applications of this theory are to star clusters, which form the subject of 
Chapter 6. Chapter 7 covers the sometimes tortuous lines of reasoning that 
are used to deduce distances in the Universe, and hence estimate such basic 
parameters as the sizes of galaxies and the distances between them. Chap- 
ters 8 and 9 are concerned with the structure and dynamics of the interstellar 
medium, in external galaxies and in the Milky Way, respectively. The inter- 
stellar medium has provided a wealth of information on the dynamics and 
the chemical evolution of galaxies. It also determines the rate at which stars 
form, which, in turn, profoundly influences the optical appearance of galax- 
ies. Our study of the Milky Way gets under way in earnest with Chapter 9 
and continues into Chapter 10, which complements the interstellar studies of 
Chapter 9 with a coherent picture of the distribution of the Galaxy’s stars. 
The study of stars near the Sun is particularly rewarding because we can 
analyze their motions and chemical compositions in great detail. A wealth 
of information about the dynamics and history of the Galaxy can be gleaned 
from such analyses. Chapter 11 completes the book by covering our knowl- 
edge of the motions of stars within external galaxies. 


Preface xiii 


In a subject area as broad as galactic astronomy, we have inevitably had 
to compromise on the material that we have included in the book, Moreover, 
it is impossible to be familiar with every relevant publication, and we have 
surely based some discussions on older or less satisfactory papers than we 
might have done. We apologize to the colleagues we have unfairly neglected, 
and trust that they will forgive us for not chasing the unachievable goal of a 
truly comprehensive work. 

Throughout, we have tried to present basic observational data, and let 
readers decide for themselves how firmly they believe the conclusions that 
have been drawn from them. Since one cannot avoid drawing on a good 
deal of theory for the interpretation of observational data, we review any 
theoretical details that are likely to be unfamiliar to a student who has a 
solid background in undergraduate physics. We have, however, sought to 
ensure that theory and observation are kept clearly separated, to enable 
students to distinguish raw facts from their model-dependent interpretation. 

In a few places, we have borrowed text directly from BT; the intention 
is to drop borrowed material from any revised edition of that work. At 
many points, we refer the reader to BT for the discussion of more theoretical 
topics such as the derivation of a dynamical result, or the explanation for a 
dynamical process. As far as possible, the notation in this book is the same 
as that in BT. 

For more than thirty years now, most undergraduate physics courses 
have employed SI units. Feeling that it is time that the astronomical litera- 
ture reflected this development, we decided — with some trepidation — to use 
SI units. Our hope is that the present generation of graduate students will 
finally start to publish in SI units, and, by doing so, make life less perplexing 
for generations to come. 

JJB wishes to thank the Director and staff of the Mount Stromlo and 
Siding Spring Observatories for their warm hospitality during an important 
phase in the writing of this book. We thank D. Hartmann, H. Morrison 
and M. Strauss for carefully commenting on some draft chapters and R. van 
der Marel for a superb list of corrigenda. Finally, we thank the numerous 
colleagues who have shared with us their thoughts, ideas, data, and figures. 
Many of our 330 figure captions mention a specific case of such debt. Al- 
though we shoulder full responsibility for any shortcomings and inaccuracies 
in the book, any credit must be widely shared across the entire astronom- 
ical community. We hope that this text represents a fitting tribute to the 
progress that the community has made in the study of galaxies, and that it 
will help in taking stock of the many important tasks that still lie ahead. 


1999 September 29 James Binney 
Michael Merrifield 
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Galaxies: an overview 


1.1 Introduction 


The Sun is located towards the outskirts of the Milky Way, a gravitation- 
ally-bound collection of stars, or galaxy, similar to countless other such 
systems.! These systems, in turn, are arranged into bound clusters and still 
larger structures, but it is the galaxies themselves that are usually consid- 
ered to be the fundamental building blocks of the Universe. Part of the 
reason that galaxies occupy this pride of place is historic: until well into the 
twentieth century, it was by no means clear that any objects existed beyond 
the confines of the Milky Way, and so no larger structures could be stud- 
ied. The huge contrast in brightness between galaxies and their surroundings 
also picks them out as basic constituents of the Universe, but it should be 
borne in mind that this distinction also represents a human bias: if our eyes 
were tuned to the X-ray part of the spectrum rather than optical light, then 
clusters of galaxies would stand out as the most impressive individual struc- 
tures. Perhaps the best explanation for the enduring appeal of galaxies to 
astronomers is even more strongly anthropocentric: with their rich variety of 


1 The Milky Way is often referred to simply as “the Galaxy.” We therefore use the 
adjective “Galactic” to mean “belonging to the Milky Way,” whereas “galactic” refers to 
galaxies in general. 


2 Chapter 1: Galaxies: an overview 


shapes and intricate spiral patterns, they provide the most visually stunning 
phenomena in the night sky. 

Our goal in this book is to present the fullest description possible of these 
beautiful objects. To this end, we will discuss the current understanding of 
the individual elements, such as stars and gas, that make up galaxies, and 
the way in which these elements are arranged to form complete systems. 
Generally speaking, small-scale phenomena are best observed within our own 
galaxy, the Milky Way, where they are sufficiently nearby to be seen clearly. 
Large-scale galactic structures, on the other hand, are often best observed 
in external galaxies where we have a clear perspective on the whole system; 
when we try to study the large-scale properties of the Milky Way, we have 
considerable difficulty in seeing the wood for the trees. Analyses of the 
properties of the Milky Way and external systems are thus complementary 
in our quest for a complete description of the properties of galaxies, and so 
in this text we draw together these disparate strands and attempt to weave 
a coherent picture from them. 

As will become clear through the course of the book, our picture of 
galaxies and their structure is still far from complete. By its very nature, 
astronomy is an observational science: we cannot tune the physical parame- 
ters of a galaxy to see how they alter its appearance; nor can we change our 
vantage point to get a better idea of its three-dimensional shape. Instead, we 
must synthesize all the fragmentary data that are observationally accessible 
in order to make the best sense that we can of our limited information. One 
key tool in this synthesis is the cosmological principle, which expresses 
our belief that the laws of physics are the same throughout the Universe. 
By applying the laws of physics as we know them locally to objects on the 
scales of galaxies, we seek to make sense of the observed properties of these 
systems. 

The remainder of this chapter is devoted to a brief description of the 
historical development of galactic astronomy. It is not intended to provide 
a comprehensive history of the subject, but rather it seeks to show how 
the various ideas that make up our understanding of galactic astronomy 
developed, and to place the material in subsequent chapters into some sort 
of context. 


1.2 A brief history of galactic astronomy 


On a moon-less summer night away from city lights, a swathe of light can be 
seen stretching across the sky from horizon to horizon. This dramatic sight 
has intrigued people since ancient times, and has been the subject of many 
myths and legends. Its appearance as a stream of diffuse white light led the 
ancient Greeks to describe it as a river of milk flowing from the breast of 
Hera, wife of Zeus — the very word “galaxy” comes from the Greek word for 
milk. The Romans, too, saw this path across the sky as a Via Lactea, or 
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Milky Way. It was only in 1610 when Galileo first turned his telescope on 
the Milky Way that it was discovered that this band is not made up from a 
luminous “celestial fluid,” but rather consists of huge numbers of faint stars 
which could not be resolved with the naked eye. Thus, the Milky Way was 
discovered to be primarily a stellar system. 

The next major development in the scientific study of the Milky Way 
came in the mid-eighteenth century, when Immanuel Kant published his trea- 
tise, General Natural History and Theory of the Heavens. In it, he demon- 
strated how the planar structure of the Solar System arose naturally from 
the attractive force of gravity from the Sun which bound the system, and the 
ordered rotation of the planets which prevented its collapse. Drawing on the 
earlier work of Thomas Wright, Kant went on to point out that the apparent 
structure of the Milky Way could arise if this stellar system were similar in 
arrangement to the Solar System, but on a huge scale. He reasoned that the 
force of gravity acts between stars just as it does between Sun and planets, 
and so the stellar system should take on a disk-like structure if it possesses a 
systematic rotational motion to balance the inward gravitational pull. The 
central plane of this distribution is usually referred to as the Galactic plane 
or the plane for short. From our location within it, a disk-shaped stellar 
distribution would be seen as a band of stars stretching across the sky in a 
great circle, just as the Milky Way appears to us. On account of the huge 
scale of the structure, the period of its rotation would be so long that the 
motions of stars on the sky would be immeasurably small. Drawing an anal- 
ogy with comets in the solar system, Kant also pointed out that the small 
number of stars found a long way from the band of the Milky Way cannot 
share the ordered motion of the major component of the system, but must 
lie on more randomly distributed orbits. Finally, Kant suggested that the 
Milky Way might not be the only such stellar system, and that some of the 
nebulae — faint, fuzzy, approximately elliptical patches of light seen in the 
sky — might be complete island universes, similar in structure to the Milky 
Way but viewed from large distances and at a variety of angles to the line 
of sight. Given the small amount of observational evidence which lay behind 
it, the remarkable prescience of this whole line of argument is a tribute to 
Kant’s powers of reasoning. 

Towards the end of the eighteenth century, increasingly powerful tele- 
scopes led to more systematic studies of nebulae. The comet hunter Charles 
Messier compiled a catalog of 109 of the brightest nebulae in the northern 
sky, primarily so that he could distinguish between these permanent diffuse 
patches of light and the transient comets that he was seeking. This list con- 
tains the most dramatic of the nebulae that are visible from the northern 
hemisphere, and prominent celestial objects are still frequently described by 
their number in Messier’s catalog. The Great Nebula in Andromeda, for 
example, is referred to as Messier 31, usually abbreviated to M31. 

In a far more extensive survey, William Herschel, his sister Caroline and 
his son John made a study of the entire sky as visible from both the northern 
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Figure 1.1 Lord Rosse’s sketch of the Whirlpool Nebula, M51, 
c. 1850. [Reproduced from Berry (1898)| 


and southern hemispheres. In the course of these observations the Herschels 
compiled a catalog of nearly 5000 nebulae. Their telescopes were also able 
to resolve the light from some of the closest nebulae — now known to be star 
clusters associated with the Milky Way — into individual stars. This discov- 
ery convinced William that many of the still-unresolved nebulae were Kant’s 
island universes which, given sufficiently good observations, would be shown 
to be made up from individual stars. However, he was also struck by the 
appearance of planetary nebulae, some of which consist of a continuous 
ring of glowing material surrounding what appears to be a single normal 
star. These systems, Herschel argued, were fundamentally different from the 
island universes, and thus the distinction between the truly gaseous nebulae 
and unresolved stellar systems was recognized. However, it was only in the 
late nineteenth century that pioneering studies of the spectra of nebulae by 
William Huggins allowed the distinction between stellar and gaseous systems 
to be quantified unambiguously. The Herschels’ original list of both gaseous 
and stellar nebulae was steadily added to over the course of the nineteenth 
century, until Dreyer (1888) produced a compilation of 7840 objects in his 
New General Catalogue. Subsequently, he supplemented this list with a fur- 
ther 5086 objects forming the Inder Catalogue (Dreyer 1895, Dreyer 1908). 
To this day, most reasonably bright non-stellar objects are identified by their 
numbers in these catalogs, abbreviated as the NGC or the IC, respectively. 

The nineteenth century saw continued improvements in observations re- 
sulting from advances in telescope technology. In 1845, William Parsons, 
Third Earl of Rosse, finished construction of a telescope with the then- 
enormous diameter of 72 inches {a size not surpassed until the completion 
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Figure 1.2 Map of the Milky Way derived from William Herschel’s “star gauging.” The 
bright star near the middle marks the location of the Sun. [Reproduced from Herschel 
(1785) 


of the 100-inch Mount Wilson telescope in 1917). With this unprecedented 
light collecting area, Lord Rosse was able to observe the faint nebulae in 
much greater detail than had previously been possible. His examination of 
the nebulae cataloged by the Herschels revealed that many of the objects fell 
into two distinct categories: some appeared as completely featureless, very 
regular elliptical distributions of light; while others were less symmetric, dis- 
playing a distinctive spiral structure (see Figure 1.1). The very appearance 
of the whirlpool-like shapes in these latter spiral nebulae added weight to 
Kant’s suggestion that these systems rotate about an axis perpendicular to 
their planes. Furthermore, Lord Rosse was able to use his powerful telescope 
to resolve individual point sources within the spiral nebulae. Although these 
objects were probably giant gaseous emitting regions rather than individual 
stars, the fact that at least some of the emission from nebulae could be re- 
solved into individual objects supported Kant’s conjecture that the nebulae 
were actually island universes made up of many distinct sources. 

At the end of the nineteenth century, the application of photography to 
astronomy revolutionized the subject. By exposing photographic plates for 
long periods, it became possible to observe much fainter objects than were 
accessible to visual observations. Moreover, photographs could record images 
of hundreds of thousands of objects on a single plate, and the brightnesses 
of the individual images could be measured much more accurately from a 
photograph than was possible through the eyepiece of a telescope. Thus, a 
new age of quantitative astronomy was born. 


1.2.1 Photometric models of the Milky Way 


In the early years of the twentieth century, the detailed structure of the 
Milky Way provided an obvious target for study using photographic tech- 
niques. Previously, only crude studies of the shape of the Milky Way had 
been possible: Herschel (1785) had attempted to determine the shape of the 
system using a technique he termed star gauging in which he laboriously 
counted the number of stars that he could observe to successive limits of ap- 
parent brightness in 683 different regions of the sky. He then assumed that 
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all the stars have approximately the same intrinsic brightness; that they are 
arranged approximately uniformly through the body of the Milky Way; and 
that he could see the stars all the way to the edge of the system. On the 
basis of these assumptions, he was able to map out the distribution of stars 
in the Milky Way: he concluded that the Sun lies close to the middle of stel- 
lar distribution, and that the distribution is flattened such that it extends 
approximately five times further in the plane of the Milky Way than in the 
direction perpendicular to the plane (see Figure 1.2). Since Herschel had no 
measure of the intrinsic luminosities of the stars that he observed, he was 
unable to put an absolute scale on the size of this system. 

In order to refine such studies by making use of the large amounts of 
information that could be obtained from photographic plates, Jacobus Kap- 
teyn decided to study in great detail 200 selected areas distributed across 
the sky. He coordinated a large collaboration of astronomers from all over 
the world to obtain the necessary photographs, and to analyze them by mea- 
suring the number of stars of different brightnesses and their small shifts 
in apparent position (proper motions) from year to year. He also used 
photographic plates to record the spectra of the stars in order to determine 
their types and their line-of-sight velocities (from the Doppler shifts in char- 
acteristic lines in the spectra). From an analysis of the proper-motion data, 
Kapteyn was able to estimate average distances for stars at various apparent 
brightness levels, and, from an analysis of the star-count data, he inferred 
the complete three-dimensional distribution of stars in space. 

The final picture which emerged from this immense undertaking (Kap- 
teyn & van Rhijn 1920, Kapteyn 1922) is usually referred to as the Kapteyn 
Universe. In agreement with Herschel’s work, Kapteyn found that we are 
located close to the center of an approximately oblate spheroidal distribution 
of stars which extends about five times as far in the plane as perpendicular 
to it. He also demonstrated that the density of stars drops uniformly with 
distance from the center of the Milky Way. Moreover, Kapteyn was able 
to use the proper motion data to provide the first estimate of the absolute 
scale for the size of the Milky Way: he concluded that the density of stars 
dropped to half its maximum value at a radius of 800 parsecs? in the plane 
of the galaxy (and thus, from the measured flattening, the density dropped 
to half its maximum value at a distance 800/5 ~ 150pc from the center 
perpendicular to the plane). In the plane, the density dropped to 10% of its 
central value at a radius of 2800 pc and 1% of its central value at a radius of 
8500 pc. 

Kapteyn’s analysis also indicated that the Sun was located slightly out 
of the plane of the Milky Way at a distance of just 650 pc from the center. 
This proximity to the center provides an uncomfortably heliocentric feature 


2 A parsec, usually abbreviated to pc, is 3.26 light years or 3.1 x 1016 m. It is roughly 
the distance from the Sun to its nearest neighbor star. Astronomers usually use kiloparsecs 
(kpc) and megaparsecs (Mpc) to measure larger distances. 
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in the Kapteyn Universe. Less than 10 percent of all the stars in this model 
lie within 700 pc of the center of the Milky Way; since presumably we could 
have evolved on a planet orbiting any of our galaxy’s stars, it is statistically 
rather unlikely that we should find ourselves so close to the center. Kapteyn 
himself was well aware that there was an alternative explanation for the data: 
if there was an absorbing interstellar medium between the stars, then the 
light from distant stars would be dimmed by the absorbing medium. If this 
dimming were incorrectly interpreted as a distance effect, then the stars 
would be erroneously placed at excessive distances, leading to a spurious 
systematic falloff in stellar density in all directions. If this effect were strong, 
then we would appear to lie close to the center of the distribution whatever 
the true arrangement of stars. 

There was plenty of evidence that some regions are, indeed, obscured: 
a dark rift along the central plane of the Milky Way is plainly visible to 
the naked eye, and numerous other dark patches where there are no stars 
show up clearly on photographs. If these regions were simply local voids 
in the distribution of stars, we should at least be able to see fainter, more 
distant stars beyond them. Stellar voids could account for the total absence 
of stars in some directions, but only if we were located at the end of a system 
of long, straight tunnels through the Milky Way, which are entirely empty 
of stars. It seemed far more likely that the absence of stars results from a 
nearby cloud of obscuring material which blocks the light from all the more 
distant objects. Kapteyn therefore expended considerable effort in trying to 
determine whether these dark clouds were isolated phenomena, or whether 
a more general absorbing medium pervades the Milky Way. 

In order to assess the significance of absorption, Kapteyn sought to 
understand the physical processes that might be responsible for the obscu- 
ration. At the time, it was already well known that gas atoms can deflect 
light rays by the Rayleigh scattering process. If interstellar space were filled 
with gas, then light traveling to us from a distant star would have a high 
probability of being scattered in a random direction out of our line of sight, 
greatly diminishing the star’s brightness. Thus, Rayleigh scattering provided 
a sensible mechanism which might produce apparent absorption in the Milky 
Way. To test this hypothesis, Kapteyn noted that the Rayleigh scattering 
process is much more efficient for blue light than for red light. We would 
therefore expect the light from stars to be more efficiently dimmed in the 
blue part of the spectrum than in the red, and so more distant stars should 
appear systematically redder — an effect known as reddening. Kapteyn 
(1909) looked for this effect by comparing the apparent brightnesses of stars 
as recorded by photographic plates to those estimated visually. Since photo- 
graphic plates are more sensitive to blue light than the human eye, reddening 
of distant stars should have produced systematically greater brightnesses for 
the visual estimates than for the photographic data. Kapteyn found only 
small amounts of reddening in his data, and concluded that obscuration was 
unimportant. 
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In fact, we now know that the dominant source of obscuration is absorp- 
tion by interstellar dust rather than Rayleigh scattering. The wavelength 
dependence of dust absorption is much smaller than that of Rayleigh scat- 
tering, and so absorption by dust reddens stars much less strongly than does 
Rayleigh scattering. With this more uniform absorption across the spectrum, 
the small amount of reddening detected by Kapteyn corresponds to a much 
larger total amount of obscuration. 

The true degree to which dust obscures our view of the Milky Way was 
only recognized when Trumpler (1930) studied a sample of open clusters — 
loose concentrations of typically a few hundred stars found close to the plane 
of the Milky Way. He estimated the distances to these clusters by measuring 
their angular extents and assuming that they all are intrinsically the same 
size. He was then able to show that stars in remote clusters are systemati- 
cally far fainter than would be predicted from their estimated distances. He 
thus demonstrated the existence of a strongly-absorbing interstellar medium 
which is responsible for this excess dimming. In fact, the amount of ab- 
sorption implied by Trumpler’s analysis is sufficient to invalidate Kapteyn’s 
analysis of star counts entirely, and so we now know that his heliocentric 
picture of the Milky Way is erroneous. 

Even at the time of Kapteyn’s analysis, evidence was mounting which 
suggested that his model of the Universe was incorrect. In a classic series 
of papers (Shapley 1918b, Shapley 1918a, Shapley 1919c, Shapley 1919b, 
Shapley 1919a), Harlow Shapley presented a radically different picture of 
the Milky Way. Shapley had undertaken a detailed study of globular clus- 
ters. These approximately spherical systems, originally classified as nebulae, 
were readily resolved with telescopes into aggregates of between 10+ and 10° 
stars. Unlike the stars of the Milky Way, globular clusters are not restricted 
to a narrow band in the sky, but are distributed throughout the sky. How- 
ever, Shapley demonstrated that this distribution is not uniform: although 
there are roughly equal numbers of clusters on either side of the plane, they 
are not distributed uniformly in longitude along the plane; instead, they 
show a marked concentration toward the great star clouds in Sagittarius, 
which also define the brightest section of the Milky Way. Shapley argued 
that the massive globular clusters must be a major structural element of the 
Milky Way, and one would expect such a major element to be distributed 
symmetrically around the center of the system. The large asymmetry in the 
distribution of globular clusters implied that we are not near the center of the 
Milky Way, in contradiction to Kapteyn’s analysis. Shapley went on to esti- 
mate the distances to the globular clusters using the apparent brightnesses 
of variable stars (with known intrinsic luminosities) and the apparent size 
and brightness of each cluster as a whole (assuming they are all intrinsically 
of comparable sizes and luminosities). On the basis of these measurements, 
he concluded that the Sun was some 15kpc from the center of the globular 
cluster distribution, and hence, presumably, from the center of the Milky 
Way. Thus Shapley’s picture of the Milky Way differed radically from the 
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Kapteyn Universe Shapley's Model 


10 kpc 


Figure 1.3 Schematic diagram showing Kapteyn’s and Shapley’s differing views of the 
size and structure of the Milky Way. Kapteyn placed the Sun (©) close to the center of 
the stellar system (represented as a grayscale), whereas Shapley used the distribution of 
globular clusters to conclude that we lay far from the true center of the Milky Way, but 
possibly in a local stellar density enhancement. 


Kapteyn Universe. Shapley further estimated that the whole globular cluster 
system was close to 100 kpc across, almost ten times larger than the Kapteyn 
model. 

As Figure 1.3 illustrates, there was clearly a major inconsistency between 
Shapley’s view of the Milky Way and Kapteyn’s Universe. In retrospect, we 
know that these models can only be reconciled if we allow for the effects of 
interstellar absorption. The absorbing dust in the Milky Way is strongly con- 
centrated to the plane, and so the apparent stellar brightnesses are strongly 
dimmed leading to the illusion that we lie close to the center of a relatively 
small stellar distribution. With the approximately spherical distribution of 
globular clusters, on the other hand, only the small fraction of the systems 
that lie close to the plane of the Milky Way will appear dimmed by dust 
absorption, so the overall derived distribution will not be significantly dis- 
torted. In this context, it is interesting to note that Shapley himself pointed 
out that no globular clusters are visible within about a kiloparsec of the plane 
of the Milky Way. Since he was unaware of the large amount of dimming 
which afflicts objects close to the plane, Shapley argued that the absence of 
detected globular clusters in this part of the sky arose because strong gravi- 
tational forces close to the plane of the Milky Way would have disrupted any 
clusters originally in this region. 

In order to try to reconcile his picture of the Milky Way with the more 
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heliocentric Kapteyn Universe, Shapley also suggested that the stellar analy- 
sis had picked out a local concentration of stars which was, indeed, centered 
close to the Sun, but that the global center of the distribution was at the 
same distance as the center of the globular cluster distribution, some 15 kpc 
away. There is certainly a small element of truth in this argument, since the 
Sun does lie close to the center of a local loose cluster of stars referred to as 
Gould’s Belt. However, the ultimate reconciliation between the disparate 
views of the Milky Way had to await the recognition that the apparent stellar 
distribution is dominated by the effects of absorption. In fact, it was only 
much more recently, with the development of computer programs capable 
of numerically evaluating the effects of extinction on stellar number counts 
[such as that written by Bahcall & Soneira (1980)], that quantitative sense 
could be made of such data. 


1.2.2 The nature of the spiral nebulae 


Shapley’s radical model of a large Milky Way must have played a key role in 
the development of his ideas regarding the nature of spiral nebulae. To him, 
the suggestion that the nebulae were independent island universes similar 
to the Milky Way was quite implausible. The extent of his new Milky Way 
model expanded the scale of the known Universe so far that it was hard 
to believe that yet more remote objects could exist. Further, the spiral 
nebulae have quite small angular sizes; if they were comparable in size to 
Shapley’s enlarged Milky Way, then they would have to lie at what were 
then inconceivably large distances. 

Shapley’s ideas were by no means universally accepted, and many still 
adhered to Kapteyn’s smaller model of the Milky Way and concluded that 
spiral nebulae were other similar systems. This split in the astronomical 
community led George Ellery Hale (director of the Mount Wilson Observa- 
tory) to suggest that the issues might be debated at the National Academy 
of Science in Washington as part of a lecture series in memory of his father, 
William Ellery Hale. Shapley himself was the obvious choice of advocate for 
the new ideas, and the role of his opponent fell to Heber Curtis, who had 
studied spiral nebulae in great detail and was convinced of their extragalactic 
nature. Their public confrontation took place at the National Academy in 
April 1920, and has since become known as the Great Debate. 

In fact, this title is something of a misnomer since the meeting did not 
take the form of a debate, but rather consisted of two short lectures in which 
each party presented his case [Hoskins (1976) gives a detailed account of the 
meeting]. Shapley’s contribution concentrated primarily on his model of the 
Milky Way, and seems to have made only passing mention of it implications 
for the nature of spiral nebulae. Curtis, on the other hand, used much of his 
allotted time to discuss the evidence for and against the island universe hy- 
pothesis, unsurprisingly concluding that the hypothesis appears valid. Since 
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each talk lasted only about half an hour and concentrated on different as- 
pects of the issue, there was no real winner in this “debate,” although the 
consensus seems to have been that Curtis was the better speaker and gave a 
clearer exposition of his ideas. 

More important than the debate itself were the expanded accounts of 
their addresses, which Shapley and Curtis published the following year (Shap- 
ley 1921; Curtis 1921). These papers went into greater technical detail than 
had been possible in their face-to-face encounter, and provided the opportu- 
nity for each to rebut the other’s arguments. Curtis’ paper cast doubt on the 
very difficult measurements that Shapley had used to calculate the distances 
to nearby stars. In particular, he was unconvinced by the large values of the 
distances that Shapley attributed to the Cepheid variable stars which were 
used to calibrate the distances to similar stars in globular clusters. Curtis 
also presented an analysis in which he incorrectly assumed that the brightest 
stars seen in globular clusters were comparable to the most luminous nearby 
stars; in fact, the brightest globular cluster members are giant stars with 
much higher intrinsic luminosities, and so Curtis radically underestimated 
their distances. 

On the issue of the nature of spiral nebulae, Curtis advanced a wide 
range of arguments in support of his thesis that they were external to the 
Milky Way. In particular, he pointed out that spiral nebulae have angular 
sizes ranging from more than two degrees for the Great Andromeda Nebula, 
M31, down to a few arcseconds for the smallest nebulae which had been 
photographed at that time. If these objects are of comparable intrinsic size, 
then their distances must differ by more than a factor of a thousand. Even if 
the Andromeda Nebula were within the Milky Way, other apparently smaller 
nebulae would have to lie at huge distances well beyond the bounds of even 
Shapley’s extended Milky Way. 

He also noted that quite a number of novae — bright stellar sources 
which appear briefly before fading back to obscurity ~ had been detected 
in the direction of M31. The large number of novae in this small region of 
sky implied that they must be physically associated with M31. Moreover, 
the novae in the direction of M31 were very much fainter than those seen 
from elsewhere in the Milky Way, implying that they were located much 
further away. Invoking the inverse-square law for the variation in apparent 
brightness with distance, Curtis calculated that M31 was at a distance of 
around 100kpc. At this distance, the angular size of M31 implied that its 
linear size was around 3kpc, comparable in size to Kapteyn’s model of the 
Milky Way. With hindsight, we know that Curtis made two mistakes in his 
analysis which conveniently canceled out: by confusing Galactic novae with 
much brighter supernovae in M31, he underestimated the distance to the 
nebula by about a factor of five; however, the Kapteyn model of the Milky 
Way is also too small by a similar factor. He thus correctly (if fortuitously) 
concluded that M31 and the Milky Way are comparable systems of similar 
linear extent. 
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Figure 1.4 Optical image of the edge-on galaxy, NGC 891, originally discovered by Car- 
oline Herschel. Note the prominent dust lane in the plane of the galaxy, and the presence 
of both extensive disk and central spheroidal stellar components. [DSS image from the 
Palomar/National Geographic Society Sky Survey, reproduced by permission] 


The dispersion of the light from spiral nebulae into spectra provided two 
further pieces of evidence in favor of the idea that they were distinct stellar 
systems. First, the characteristic absorption lines in the spectra of many 
nebulae are the same as those seen in normal stars, and the total spectra 
resemble closely the spectrum that would be obtained from the integrated 
starlight of the Milky Way, which suggests that they are similar systems. 
Second, the lines in the spectra of both stars and galaxies are shifted rela- 
tive to their rest wavelengths. These shifts can be understood if they arise 
from the Doppler shift due to the line-of-sight motions of the emitting body. 
However, the inferred velocities for the nebulae are many times larger than 
those of the stars in the Milky Way, suggesting that the nebulae and stars 
do not form a single dynamical entity. Moreover, although the nebulae have 
large line-of-sight velocities, their positions on the sky have not changed de- 
tectably with time. Unless the nebulae all happen to be moving exactly 
away from us, which seems most unlikely, the lack of any apparent motions 
transverse to the line of sight implies that they must be at very large dis- 
tances. However, no explanation was offered as to why the vast majority 
of spiral nebulae have Doppler shifts which imply that they are receding 
from the Milky Way. This systematic effect could easily be interpreted as 
suggesting that the Milky Way is repelling the nebulae, implying that they 
must be physically associated with each other. The true significance of this 
phenomenon, the expansion of the Universe initiated by the Big Bang, would 
only be uncovered almost a decade later. 
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Figure 1.5 Map showing the distribution of New General Cata- 
logue (NGC) and Index Catalogue (IC) objects which have been 
identified as spiral or elliptical nebulae. In this Aitoff projection, 
the plane of the Milky Way runs horizontally through the center 
of the map. Note the dearth of objects in the “zone of avoidance” 
within fifteen degrees of the plane. 


One last particularly elegant piece of reasoning presented by Curtis came 
from the observation that edge-on spiral nebulae often contain dark bands 
through their centers (see Figure 1.4). Curtis interpreted this band as a 
ring of absorbing material surrounding the systems. He further pointed out 
that the zone of avoidance ~ a region close to the plane of the Milky Way 
where no spiral nebulae are observed (see Figure 1.5) — could be explained 
naturally if the Milky Way were encircled by a similar absorbing ring, and 
the spiral nebulae all lay beyond this ring. He thus neatly brought together 
the ideas that the Milky Way would look similar to a spiral nebula if viewed 
from the outside, and that the nebulae must lie beyond the fringes of the 
Milky Way. The only slight flaw in this reasoning was the assumption that 
the absorbing material lies in a single ring at large radii; as we now know, 
absorption arises from dust which is distributed throughout the plane of the 
Milky Way. Curtis was forced to assume that the absorbing material lay 
only at large distances since he adhered to Kapteyn’s analysis of the stellar 
distribution in the Milky Way which relied on their being no absorption of 
stellar light within the system. 

For his part, Shapley produced several strong points in favor of the hy- 
pothesis that spiral nebulae are intrinsically different from the Milky Way. 
Foremost amongst these arguments was his belief that the Milky Way was a 
very large system, which would imply that the relatively small spiral nebulae 
would have to be at tremendous distances to be comparable systems. How- 
ever, he also pointed to a number of analyses which imply that the Milky 
Way has other properties that distinguish it from the spiral nebulae. The 
Milky Way has a lower surface brightness (i.e. luminosity per unit area) than 
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any of the spiral nebulae, and many spiral nebulae were observed to be sig- 
nificantly bluer than stars in the Milky Way. These distinctions implied that 
the nebulae and the Milky Way could not be intrinsically similar systems. 
With hindsight, we can put the differences down to the effects of the absorp- 
tion of which both Shapley and Curtis were unaware: since we see the Milky 
Way edge-on, stars on the far side of the Galaxy are viewed through much 
absorbing material, and so the surface brightness of the system is greatly 
reduced when compared to observations of more face-on systems; similarly, 
light from the bright blue stars which populate the spiral arms in the Milky 
Way is strongly attenuated, and so we preferentially see the more-plentiful 
nearby faint red stars, which makes the system appear redder than external 
galaxies. 

Perhaps the most telling argument reported by Shapley was the result 
of a set of measurements by Adriaan van Maanen, which compared images 
of several spiral nebulae taken over a number of years. The comparisons 
implied that the spiral structure in these systems rotates at a perceptible 
rate, with a rotational period of only around 10° years. If the nebulae had 
radii in excess of around 5kpc, then their outer parts would have to be 
rotating at velocities in excess of the speed of light in order to maintain this 
apparent rotation rate! The absurdity of this conclusion implied that the 
spiral nebulae must be very much smaller than 5kpc, and so could not be 
comparable in size to the Milky Way. The reasoning here was impeccable, 
and Curtis conceded that if the observations were confirmed then the whole 
island universe hypothesis would have to be abandoned. Van Maanen was a 
well-respected observer, and so his results carried a great deal of weight at 
the time. It was only several years later, when Lundmark re-measured van 
Maanen’s photographic plates, that it was conclusively demonstrated that 
the detection of rotational motion was spurious. We will probably never 
know where van Maanen went wrong in this set of measurements. 

It is clear that both Shapley and Curtis had grasped important aspects 
of the truth as we now perceive it. In her review of their encounter, Trimble 
(1995) makes the insightful point that each party made best use of data 
that they had collected themselves: Shapley’s analysis of his globular cluster 
observations produced the fundamental change in ideas about the location 
of the Sun and the size of the Milky Way, and Curtis’ understanding of the 
true nature of the spiral nebulae came from his long study of these systems. 
Most of the confusion arose when other peoples’ analyses, such as Kapteyn’s 
model of the Milky Way and van Maanen’s rotation measurements, were 
added to the debate. 

Within five years, Edwin Hubble had resolved the controversy as to 
the nature of spiral nebulae once and for all. Using the superior optics of 
the recently-completed 100-inch telescope at Mount Wilson, he was able to 
resolve the outer parts of two nearby spiral nebulae into swarms of very faint 
objects indistinguishable from stars. If these objects were assumed to be 
comparable to the brightest stars in the Milky Way, then the nebulae must 
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be at large distances. The clinching observation came in late 1923 when 
Hubble established that the brightnesses of a few of these myriad stellar 
images in the Andromeda Nebula, M31, varied in the characteristic periodic 
manner of Cepheid variable stars. It had already been established that 
the intrinsic luminosities of these particular stars can be determined directly 
from their periods of variability, and so by measuring the periods of the 
Cepheids in M31, Hubble was able to estimate their luminosities. Measuring 
their apparent brightnesses then yielded a direct estimate for their distances. 
Using this method, Hubble (1922) obtained a value for the distance to M31 
of some 300kpc. Although more recent calibrations of the luminosities of 
Cepheids imply that this estimate is more than a factor of two too small, 
it nonetheless firmly demonstrated that the Andromeda Nebula is not an 
element within the Milky Way, but is a comparable stellar system in its own 
right. Thus, it was finally established that the Milky Way is but one galaxy 
amongst its peers. The spiral nebulae are thus actually spiral galaxies, 
while the more featureless elliptical nebulae are mostly elliptical galaxies. 


1.2.3 Kinematic models of the Milky Way 


At about the same time that Hubble was demonstrating that spiral nebu- 
lae are indeed separate galaxies, Bertil Lindblad was producing new insights 
into the properties of the Milky Way using an entirely different approach. 
He calculated the total mass of Kapteyn’s model by adding up the contribu- 
tions from all the stars. He then used the Doppler shifts in the lines seen in 
spectra of globular clusters to show that these objects move with velocities 
as high as 250kms~+. Such speeds are significantly higher than the escape 
velocity from Kapteyn’s Universe; the relatively small total mass of this sys- 
tem results in a gravitational field that is too weak to retain the globular 
clusters as gravitationally bound members of the system. The fact that a 
large number of globular clusters are associated with the Milky Way implies 
either that the true gravitational forces are stronger than those predicted 
by the Kapteyn model, thus permanently binding the clusters to the Milky 
Way, or that globular clusters continuously form at a sufficient rate to replace 
their escaping kin. Since globular clusters each contain up to a million stars, 
there is not enough mass in the Kapteyn Universe to provide the raw mate- 
rials required to replenish the continually-escaping globular clusters. These 
dynamical arguments provided perhaps the last nail in the coffin of the Kap- 
teyn Universe, and suggested that Shapley’s model for the Milky Way with 
its larger mass (and correspondingly greater escape velocity) was closer to 
the truth. 

Lindblad (1927) went on to develop a more detailed kinematic model 
of our galaxy, which sought to explain its apparent structure in terms of 
the motions of its constituents under their mutual gravitational attraction. 
He proposed that the Milky Way might be divided into a number of sub- 
systems, each of which was symmetric about the central axis of the whole 
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system (which, in agreement with Shapley, he placed a considerable distance 
from the Sun). Each component was further assumed to rotate about this 
symmetry axis with some characteristic speed. Echoing the ideas of Kant 
two centuries earlier, Lindblad noted that the degree to which each subsys- 
tem is flattened would depend on whether its motions were dominated by 
rotational or random motions: the most slowly rotating subsystems would 
be made up of objects on largely random orbits and hence would display 
little in the way of flattening. 

Since Shapley’s analysis had shown that the distribution of globular 
clusters in the Milky Way is approximately spherical, Lindblad proposed 
that these objects constitute a subsystem with almost no rotational motion. 
Furthermore, since almost all the stars in the solar neighborhood possess 
very little velocity relative to the Sun, the random component of these stars’ 
motions must be small, and so Lindblad concluded that they must follow 
well-ordered circular orbits and thus produce the highly-flattened disk of the 
Milky Way. By measuring the velocity of the Sun and its neighboring stars 
relative to the mean velocity of the non-rotating globular cluster population, 
Lindblad was able to demonstrate that the nearby disk stars rotate around 
the Milky Way with a velocity of between 200 and 300kms~?. 

These ideas were developed into a complete theory of Galactic stellar 
kinematics by Jan Oort (Oort 1927, Oort 1928). Amongst the phenomena 
explained by this theory were the so-called high-velocity stars. These 
stars make up an asymmetric tail in the distribution of stellar velocities, 
traveling at velocities such that they lag behind the rotational motion of the 
Sun. Thus, although these stars have high velocities relative to the Sun, 
they are somewhat misnamed since they are actually moving at very low 
speeds, with their high relative velocities arising almost entirely from the 
motion of the Sun around the Milky Way. Oort pointed out that such a 
population would arise if there were a stellar component of our galaxy with 
little net rotation. The members of this population would, while in the solar 
neighborhood, have insufficient tangential velocities to maintain them on 
circular orbits against the gravitational pull of the Milky Way, and thus their 
orbits should carry them radially inward toward the center of the system. 
Since this population of stars has little net rotation, we should expect them 
to form an approximately spherical concentration toward the center of the 
Milky Way. Oort’s analysis of the remaining stars in the solar neighborhood 
showed that their kinematics were exactly what would be expected if they 
were following approximately circular orbits in a differentially rotating disk 
(i.e., one in which stars nearer the center have a faster angular rotation 
rate than those at larger radii) with the Sun located far from the disk’s 
center. Images of edge-on spiral galaxies (see Figure 1.4) reveal the presence 
of both relatively unflattened spheroidal distributions of stars at their centers 
and highly-flattened disks that extend to larger radii. Thus, Oort’s stellar 
kinematic analysis confirmed the idea that the Milky Way is indistinguishable 
in structure from these external galaxies. 
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A great advance in dynamical studies of the Milky Way and other galax- 
ies came from the discovery that the gas in these systems emits at radio 
wavelengths. Karl Jansky established in 1932 that the Milky Way emits a 
broad spectrum of radio waves, but the major breakthrough came from the 
discovery that it also emits strongly in a spectral line at 21cm. The existence 
of this emission line, arising from a hyperfine transition in atomic hydrogen, 
was predicted by H.C. van de Hulst in 1944, but it was not until 1951 that 
this prediction was observationally confirmed by Ewan and Purcell at Har- 
vard, Christiansen in Australia, and Muller and Oort in the Netherlands. 
This radio emission provided the ideal tool for studying the large-scale kine- 
matics of the Milky Way: the distinctive line enabled astronomers to measure 
line-of-sight motions of atomic hydrogen via Doppler shifts in the line’s de- 
tected wavelength. Further, radiation at such long wavelengths is entirely 
unaffected by dust, and so the absorption which limits our ability to study 
the stellar structure of the Milky Way at optical wavelengths ceases to be a 
problem. Observations of 21cm emission revealed that atomic hydrogen is 
a major component of the Milky Way, permeating its disk out to distances 
beyond twice the solar radius; it therefore provides us with a tracer of the 
properties of the disk of the Milky Way over a wide range of radii. 

By combining 21cm observations obtained from both the northern and 
southern hemispheres, Oort, Kerr & Westerhout (1958) were able to produce 
the first complete map of the distribution of atomic hydrogen throughout 
much of the Galaxy (see Figure 1.6). These and subsequent observations 
showed that the gas is strongly concentrated toward the plane, that it is 
distributed fairly uniformly in azimuth, and that it travels on approximately 
circular orbits about the Galactic center (as would be expected for such a 
highly flattened component). 

More detailed examination of the gas distribution revealed that such 
a simple axisymmetric model is an oversimplification. Even on the largest 
scales, the atomic hydrogen in the Milky Way is not arranged uniformly in 
azimuth: more gas lies on one side of the system than on the other. The 
gas disk is not flat, either, but warps away from the plane of the Milky Way 
at large radii. As Figure 1.6 shows, there are also long, narrow regions in 
which the density of gas is significantly enhanced, and Oort, Kerr & West- 
erhout (1958) were quick to identify these features with the spiral structure 
found in other galaxies (see Figure 1.1). Since the arms are associated with 
density enhancements in the disk, it is not surprising that these asymmetric 
perturbations in the gravitational potential produce non-circular motions in 
the orbiting gas. The map presented in Figure 1.6 was constructed by as- 
suming that the gas follows circular orbits, and the presence of non-circular 
motions distorts its appearance. It is this phenomenon which is responsible 
for the spurious manner in which the arm features appear to converge on the 
Sun. Thus, although mapping out the Galaxy through its 21cm emission 
represented a major advance in our study of the structure of the Milky Way, 
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Figure 1.6 The distribution of atomic hydrogen in the plane of 
the Milky Way as inferred from early 21cm observations. The Sun 
is located at the point marked “©.” The empty cone toward the 
Galactic Center and anti-center cannot be mapped out using the 
kinematic technique employed here. [Reproduced from Oort, Kerr 
& Westerhout (1958), by permission] 


the complexity of interpreting non-circular motions meant that the picture 
was still ambiguous. 

The strongest signs of non-circular gas orbits in the Milky Way come 
from close to the Galactic center, where 21cm emission has been observed 
to be Doppler shifted from its expected circular velocity by hundreds of kilo- 
meters per second. These anomalous velocities were originally interpreted 
as implying that the gas is being flung from the center of our galaxy, but it 
was subsequently realized that they could equally be interpreted as resulting 
from non-circular orbits in a strongly non-axisymmetric gravitational poten- 
tial (Peters 1975). Since at least one third of external spiral galaxies are 
observed to contain a bar-like asymmetric structure at their centers (see Fig- 
ure 1.7), we might reasonably conclude that the non-circular motions in the 
Milky Way are induced by a similar galactic bar. 

The early radio observations used single dish antennae, which gave a 
spatial resolution of only a degree or so, and this limitation made it difficult 
to study even the large-scale distribution of hydrogen in external galaxies. 
Far greater resolution was achieved with the construction of interferom- 
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Figure 1.7 Optical image of the galaxy, NGC 3992. Note the 
prominent bar in this system in addition to the spheroidal and 
disk stellar components. [DSS image from the Palomar/National 
Geographic Society Sky Survey, reproduced by permission] 


eters, which consist of a set of moderate-sized radio dishes arrayed over 
several kilometers. The angular resolution of such an instrument is equiva- 
lent to that of a single dish the size of the whole array. Data obtained from 
these interferometers, such as the Westerbork Synthesis Radio Telescope in 
the Netherlands and the Very Large Array in New Mexico, revealed that 
other spiral galaxies also have hydrogen disks in which the gas follows ap- 
proximately circular orbits, but with the same complications due to warping 
and non-circular streaming as the Milky Way. These disks could be traced 
to large radii, well beyond the observed optical edges of the galaxies. 

Since it is the gravitational pull of a galaxy that provides the force which 
retains the gas on its approximately circular orbits, it is straightforward to 
translate the observed orbital velocities in a gas disk into an estimate of 
the distribution of mass in the galaxy. We can compare the mass derived 
in this way to the sum of the masses of the individual components such 
as the stars in the disk and spheroid, and the hydrogen gas itself. The 
surprising result of such comparisons in the 1970s was that the total mass 
inferred dynamically exceeds the sum of all the known components, and 
that the discrepancy becomes greater as we look to larger radii in a galaxy. 
From these observations it was inferred that galaxies are embedded in huge 
dark halos, which contain considerably more mass than the visible galaxies 
themselves. This unexpected discovery was originally dubbed the “missing 
mass problem”, but this title is somewhat confusing, since we know from the 
kinematics that there is an excess of mass rather than a deficit. The real 
problem is that we do not know what form the excess mass takes, since it 
evidently is not made up of directly-visible material. Our inability to identify 
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the nature of the material which dominates the mass of many systems is 
therefore now usually referred to as the “dark matter problem.” Its resolution 
remains one of the major goals of galactic astronomy. 


1.2.4 Stellar populations 


The recognition by Lindblad and Oort that spiral galaxies could be bro- 
ken down kinematically into separate spheroidal and disk components ties 
in closely with the key notion of distinctive stellar populations. Taking 
advantage of the wartime blackout in Los Angeles, Baade (1944) used the 
100-inch Mount Wilson telescope to resolve individual stars in the inner 
regions of several nearby spiral galaxies, where the spheroidal component 
dominates. He also obtained resolved stellar images in a couple of nearby 
elliptical galaxies. By analyzing the colors and brightnesses of the stellar 
images, Baade realized that the brightest stars in both elliptical galaxies and 
the spheroidal components of spiral galaxies are red giants; these stars are 
quite different in appearance from the blue supergiants which dominate 
the spiral arms in the disks of galaxies. These observations suggested to 
Baade the existence of two characteristic stellar populations: population I, 
which contains luminous blue stars, accompanied by dust and gas; and pop- 
ulation II, which is dominated by luminous red stars in an essentially gas- 
and dust-free environment. Open clusters and stellar disks comprise pop- 
ulation I material, while globular clusters, galactic spheroids and elliptical 
galaxies are made up of population II stars. 

Detailed studies of the colors and brightnesses of stars in the different 
types of star cluster confirmed the clear distinction between the two stellar 
populations. In the early part of the twentieth century, Ejnar Hertzsprung 
and Henry Norris Russell independently discovered that, in a plot of color 
versus luminosity, stars are not randomly scattered, but rather are concen- 
trated within tightly defined bands. Such a plot was termed an HR di- 
agram after its inventors, although in its modern form it is more usually 
described as a color-magnitude, or CM diagram. When CM diagrams 
were constructed for stellar clusters, the bands which are populated by the 
stars of open clusters were found to differ from those which are populated by 
globular-cluster stars. From this discovery it followed that the stellar popu- 
lations of these two types of cluster contain very different types of stars. 

The location of a star in a CM diagram depends on the rate at which it 
is generating energy by nuclear fusion in its core, and on the structure of the 
star itself. The two decades up to about 1965 saw advances in understanding 
of the energy generation processes and the development of computers capable 
of calculating detailed models of stellar structure. It thus became possible to 
predict where a star should appear on the CM diagram, and how its location 
changes with time as the star’s structure evolves. From these calculations, it 
has become clear that the difference between open and globular clusters arises 
from their ages: younger stars occupy the regions of CM diagrams populated 
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by open-cluster stars, whereas globular-cluster stars fall only in regions of the 
CM diagram that are occupied by old stars. Thus, the distinction between 
populations I and I is also one of age, with population I stars having formed 
more recently than those of population II. 

Confirmation of this distinction has come from detailed spectral analysis 
of stars from the two populations. These studies allow one to determine the 
chemical compositions of stars from the strengths of characteristic spectral 
absorption features. Population II stars are found to be very deficient in all 
elements heavier than helium (or metals as they are somewhat confusingly 
termed by astronomers), whereas population I stars have rather higher metal 
abundances, comparable to the solar fraction of heavy elements (which is 
not very surprising since the Sun is itself a population I star). Some of the 
heavy elements found in stars are only produced in the supernova explosions 
which end many stars’ lives. Hence, the presence of these elements in a star 
indicates that the star must contain material synthesized by supernovae in a 
previous stellar generation. The difference between the metal abundances in 
the two stellar populations thus once again points to a difference in ages. A 
star that formed late in the history of a galaxy will be made from material 
that has had time to be processed through several earlier generations of stars, 
and will thus generally have high metal abundances; the star will therefore be 
classified as a population I object. A star that formed much earlier will have 
been produced from material whose chemical abundances lie much closer to 
the primordial composition of the Universe with only very small amounts of 
metals, and so the star will be a population II object. 

It therefore seems highly probable that the globular clusters and spheroid 
of the Milky Way, being made up of population II stars, are the parts of the 
Milky Way that formed first, with the population I stars in the disk forming 
later. This evolutionary sequence is confirmed by the fact that stars are ob- 
served to form in the disk even today, whereas no star-forming regions can be 
seen away from the Galactic plane. It is notable that even the most metal- 
poor stars that have been observed in the Galaxy contain trace amounts 
of heavy elements, which they could not have synthesized themselves. The 
idea has therefore been mooted that there might have been a still-earlier 
generation of population III stars which produced these elements, but the 
existence of such objects lacks any direct observational confirmation. 


1.2.5 More recent developments 


As should be apparent from the above discussion, much of the history of 
astronomy has been dictated by the technology available to astronomers. We 
have, for example, been able to form images of progressively fainter objects 
as larger and larger telescopes have been built. Similarly, the quantitative 
analysis of astronomical images only became practicable with the invention 
of photographic detectors which could be used to obtain a permanent record 
of the data. 
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Technological advances continue to play a key role in the development of 
the subject. For example, the rapid developments of computer technology in 
the latter part of the twentieth century have allowed astronomers to handle 
large quantities of data and to automate much of the analysis of these data. 
With these techniques, it has been possible to undertake enormous surveys 
which automatically monitor the light from literally millions of stars on a 
nightly basis. The purpose of these experiments is to look for the magnifica- 
tion of light from a distant star that will occur if a massive object forms a 
temporary gravitational lens as it passes through our line of sight to the star. 
If the dark halo of the Milky Way is made up of compact massive objects, 
then their presence could be detected by this phenomenon. The probability 
of a single star being magnified in this way is very small, and so it is only by 
using computers to monitor automatically very large numbers of stars that 
we stand a good chance of detecting the effects and thus determining the 
content of the Milky Way’s dark halo. 

Observational techniques have also continued to benefit from advances 
in technology. The introduction of very efficient charge coupled devices 
(CCDs) to replace photographic plates as light detectors has revolutionized 
optical astronomy. These detectors are similar to those found in camcorders, 
except that they are cooled with liquid nitrogen to reduce the level of thermal 
noise. Since CCDs are some ten times more efficient than photographs at 
detecting light, they have enabled astronomers to study very faint phenom- 
ena such as the outermost parts of galaxies for the first time. Developments 
in detector technology have also expanded the region of the spectrum over 
which observations can be made. Of particular importance has been the de- 
velopment in the 1980s of infrared detector arrays analogous to the optical 
CCDs. These efficient digital devices have made it possible to study galaxies 
in the near infrared, where the absorbing effects of dust are small, providing 
a clearer picture of the underlying stellar light distribution. 

In addition to the improvements in light-gathering potential provided 
by the new detector technologies, advances have also been made through 
the construction of ever bigger telescopes. A number of telescopes with 
diameters of around eight meters are currently at various stages of planning 
or construction. However, an even larger telescope is already in operation: 
the Keck Telescope on Mauna Kea in Hawaii consists of 36 hexagonal mirror 
segments each 1.8 meters across, mounted together in a honeycomb pattern. 
The exact alignment of all the mirror segments is maintained by computer, 
allowing the system to act as a single telescope with a light-collecting power 
that is equivalent to that of a single mirror 10 meters in diameter. A duplicate 
of this amazing piece of technology is being built adjacent to the original, so 
that these two huge telescopes will be able to operate in concert. 

Significant advances have also been made in the quality of the images 
recorded. Motions in the earth’s atmosphere distort the light from astro- 
nomical objects, causing the familiar twinkling of stars. These distortions 
blur out any structure in objects which have angular scales of less than about 
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an arcsecond. This degradation can be ameliorated by furnishing telescopes 
with moving mirrors that compensate for the motions of the atmosphere in 
order to reduce its distorting influence. Such active optics can decrease 
the amount of blurring by more than a factor of two, but still greater gains 
can be made by getting above the earth’s atmosphere entirely. The Hubble 
Space Telescope (HST) is a satellite telescope with a 2.4m mirror, which 
was designed to produce images that can resolve structure down to a scale 
of 0.1 arcseconds. Initially, a flaw in the shape of the mirror prevented this 
goal from being achieved, but the introduction of new optics to correct its 
myopia has enabled the HST to exploit fully its location above the earth’s 
atmosphere. In addition to studying small-scale structures such as the very 
cores of galaxies in unprecedented detail, the improved image quality and 
lower background light level in space have allowed the HST to image very 
faint objects; such studies have, for example, demonstrated that the dark 
matter halo of the Milky Way cannot be made up from intrinsically faint 
low-mass stars. 

Observations from space have proved important in several other areas 
of the study of galactic structure. The Hipparcos Satellite, for example, 
contained a small (29cm diameter) optical telescope dedicated to measuring 
accurate stellar positions (a field of study known as astrometry). Over its 
four year lifetime, this satellite measured the positions of more than 100 000 
stars to an accuracy of ~ 0.002 arcseconds. In addition to the absolute 
positions of the target stars on the sky, the satellite also measured the small 
periodic shift in apparent location of the stars that arises from the changing 
position of the earth as it orbits around the Sun. The magnitude of this 
effect, known as parallax, depends on the distance to the star, and so the 
Hipparcos observations provide a measure of the distances to all the target 
stars out to distances of several hundred parsecs (at which point the parallax 
becomes too small to measure). Thus, Hipparcos measured the full three- 
dimensional spatial coordinates for a large sample of relatively nearby stars. 
The satellite also recorded the shifts in the location of the stars due to 
their own orbital motions; combining these observations with the line-of- 
sight velocities inferred from Doppler shifts in spectra obtained from the 
ground, we can reconstruct the full three-dimensional velocities of the target 
stars. Thus, Hipparcos has provided us with a wealth of information on the 
spatial and kinematic properties of the Milky Way which has yet to be fully 
exploited. 

The development of astronomical satellites has also opened up parts of 
the electromagnetic spectrum that are unobservable from the ground due to 
their absorption by the earth’s atmosphere. The whole field of X-ray astron- 
omy, for example, only became possible with the advent of space flight. Solar 
observations were made from rockets as early as the 1940s, and these exper- 
iments were followed by rocket observations of the moon and other bright 
X-ray sources in the 1960s. However, X-ray astronomy only really came of 
age with the development of satellite technology, and particularly the launch 
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Figure 1.8 Infrared image of the Milky Way obtained by the COBE satellite. [Reproduced 
courtesy of NASA Goddard Space Flight Center and the COBE Science Working Group} 


of the Einstein Observatory in 1978. This mission and its successors have 
produced images of some of the most energetic phenomena in the Universe, 
including the ultra-luminous active galactic nuclei, believed to be pow- 
ered by super-massive black holes, which lurk at the centers of some galaxies. 
X-ray observations have also revealed an entire new component of galaxies: 
a halo of gas which is so hot that it emits at these short wavelengths. The 
discovery of this component in elliptical galaxies was particularly surpris- 
ing since, as we have discussed above, it had long been supposed that these 
systems were essentially gas-free. 

One last example of space-based observations of importance to the study 
of galaxies comes from the infrared part of the spectrum. At the longer 
infrared wavelengths, the earth’s atmosphere is opaque and so little could 
be learned from the ground. The Infrared Astronomical Satellite (IRAS), 
launched in 1983, provided the first comprehensive survey at these wave- 
lengths. Cool dust in galaxies re-radiates the light that it absorbs at these 
long infrared wavelengths, and so this survey gave us the first comprehen- 
sive map of the distribution of dust in the Milky Way. These observations 
revealed that the far infrared emission from dust is strongly concentrated 
toward the plane of the Galaxy, which ties in with the large amount of op- 
tical extinction in this region. However, it also turned up the unexpected 
discovery of diffuse patches of emission ~ termed cirrus by analogy with the 
terrestrial thin clouds — far from the plane of the Milky Way. Dust is clearly 
more pervasive in the Galaxy than had been expected. 

A more recent study of the Milky Way at infrared wavelengths has been 
made by the Cosmic Background Explorer (COBE) satellite. Although pri- 
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marily intended to test cosmological theories, the telescopes on this satellite 
also recorded the strong flux from both stars (at near infrared wavelengths) 
and dust (at far infrared wavelengths) in the Milky Way. At the near in- 
frared wavelengths, little light is absorbed by dust, and so the all-sky image 
produced by COBE at these wavelengths provides us with a uniquely clear 
picture of the large-scale stellar distribution in the Milky Way. The COBE 
image of the plane of the Milky Way is presented in Figure 1.8. Comparing 
this image to the picture of NGC 891 in Figure 1.4, it is strikingly apparent 
that — once the confusing effects of dust obscuration have been removed — the 
Milky Way is indistinguishable from the multitude of other galaxies which 
make up the cosmos. 


2 


Astronomical Measurements 


Our basic observations of astronomical objects yield information about their 
positions, their motions on the sky, and the distribution of the energy emitted 
in their spectra as a function of frequency and time. Essentially everything 
else we wish to know about astronomical bodies must follow from deductions 
made from analyses of these data, using the laws of physics established in 
terrestrial laboratories. It is actually rather remarkable that we know as 
much about the stars and galaxies as we do, given the severe limitation of 
having at our disposal only the photons that they casually emit. 

Our knowledge of astrophysical objects has been painstakingly acquired 
by observing them with delicate and complex equipment, usually followed by 
an elaborate set of reductions to eliminate extraneous effects — see, for exam- 
ple, Budding (1993), Sterken & Manfroid (1992), or Appenzeller, Habing & 
Léna (1989). To communicate the results of such measurements and reduc- 
tions, astronomers have developed a framework of concepts that are peculiar 
to astronomy. In this chapter we develop this framework and introduce its 
associated vocabulary. 
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Horieon Figure 2.1 The celestial sphere, 
showing the celestial poles, celestial 
equator, meridian, zenith point, and 
horizon. 
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2.1 Positions, motions and coordinate systems 


Positional astronomy, or astrometry, is the oldest branch of astronomy. 
Catalogs of the locations of astronomical objects on the sky have come down 
to us from antiquity. It is probably fair to say that astrometry was the cen- 
tral problem of observational astronomy through the early twentieth century. 
Later in the twentieth century, astrophysical observations attracted more in- 
terest and generated more excitement than astrometry, to the extent that 
students sometimes have the (mistaken) impression that positional astron- 
omy is old fashioned and even unimportant. Such an opinion is grievously 
wrong, not only because the data obtained by astrometric measurements lie 
at the foundation of all astrophysics, but also because in recent years the 
subject has undergone a renaissance. Technological advances that got under 
way around 1980, have spectacularly improved the accuracy, homogeneity, 
and extent of the available astrometric data. In 1997 the field was revolu- 
tionized by the publication of the Hipparcos Catalogue, which is based on 
data taken by the first astrometric satellite; the catalog contains data of un- 
precedented homogeneity and accuracy for more than 100000 nearby stars. 
These new data will over time have a profound influence on almost every 
branch of astrophysics, from stellar astronomy to cosmology. 

The issues that positional astronomy deals with are anything but triv- 
ial. Among them is the task of establishing in practical terms, an inertial 
reference frame. Extensive discussions of this problem can be found in Green 
(1985), Perryman et al. (1997), Seidelmann (1992) and Murray (1983). In 
our treatment of this subject we start by studying how astronomers report 
the positions and motions of objects on the sky. 


2.1.1 The equatorial system 


Any mention of the position of an object implies that a coordinate system has 
been chosen and established observationally. The fundamental coordinate 
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to star 


Figure 2.2 The zenith distance of a 
star is the angle z, and its altitude is 
the angle a. 


latitude 


NCP 


Autumnal 
equinox 


Figure 2.3 The relationship between 
Celestial the celestial equator, the ecliptic and 
equator the equinoxes. The angle between 
gcp the ecliptic and the equator is called 
the obliquity of the ecliptic. 


system for observations made from the surface of the Earth is the equatorial 
system. The Earth rotates eastward on its axis once a day, and, as a 
consequence, the sky appears to rotate westward about the Earth. The 
extension of the Earth’s axis to the celestial sphere (an imaginary sphere 
of infinite radius centered on the earth) defines the north and south celestial 
poles (NCP and SCP), and the extension of the Earth’s equatorial plane 
determines the celestial equator (see Figure 2.1). The equator is a great 
circle; that is, a circle on the celestial sphere defined by the intersection of 
a plane passing through the sphere’s center with the surface of the sphere. 

For each observer on the Earth’s surface, the direction of gravity fixes the 
direction of the local vertical; that point at which the extended vertical line 
intersects the celestial sphere is the zenith. The great circle passing through 
the celestial poles and the zenith is the meridian. The horizon is the great 
circle whose pole is the zenith (and thus lies 90° from the zenith). 

The zenith distance z of a star is its angular distance from the zenith 
(Figure 2.2). The complement of z is the star’s altitude a = 90° — z. The 
altitude of the NCP equals the observer’s latitude on the surface of the Earth. 

As the Earth revolves annually in its orbit around the Sun, the Sun 
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Celestial equator 


—> Fast Figure 2.4 The position of the star 
in the equatorial system is specified 

SCP by its right ascension a and its decli- 
nation 6. 


appears to move from west to east around the celestial sphere on a path 
called the ecliptic — see Figure 2.3. The ecliptic is the great circle in which 
the Earth’s orbital plane intersects the celestial sphere. The Earth’s rotation 
axis is inclined away from the normal to its orbit by an angle of about 
23°27’; hence the ecliptic is also inclined to the celestial equator by this angle, 
which is called the obliquity of the ecliptic. The ecliptic and the celestial 
equator intersect at two points (separated by 180°) called the vernal and 
autumnal equinoxes. The Sun passes through the vernal equinox (VE) on 
approximately March 21, moving from south to north of the celestial equator. 
About six months later, it passes through the autumnal equinox traveling 
from north to south. The time interval between successive returns of the Sun 
to the VE is called the tropical year — see Figure 2.3. 

The great circle through the celestial poles and a star’s position is that 
star’s hour circle. The star’s hour angle is the angle around the celestial 
equator between the meridian and its hour circle. The star’s right ascen- 
sion a is the arc of the celestial equator from the VE to the star’s hour 
circle. Right ascensions increase from west to east so that stars with large 
right ascensions rise later than those with small ones (see Figure 2.4). The 
declination 6, is the angular distance measured from the equator along a 
star’s hour circle to the star, and it is positive northward and negative south- 
ward. For example, the NCP is at 6 = +90°, the SCP is at 6 = —90°, and 
the declination of the celestial equator is 0°. @ and 6 play the roles for the 
celestial sphere that longitude and latitude fulfill for the Earth. 

Angles on the celestial sphere can always be expressed in radians or in 
degrees, minutes, and seconds of arc. Because a star’s right ascension is very 
nearly equal to the time between the meridian transit of the star and that of 
the VE,! right ascensions are usually expressed in terms of hours, minutes, 
and seconds of time. The conversions from time to angular measure are 
24 — 360°, 12 = 15°, 1™ = 15’, and 15 = 15”. The local sidereal time 


1 The right ascension differs from this time difference only because the Earth does not 
rotate quite uniformly — see §2.1.5. 
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is the right ascension of the meridian expressed in units of time. Hence a 
star’s hour angle, also expressed in units of time, is the difference between 
the local sidereal time and the star’s right ascension, and is equal to the time 
since the star crossed the meridian. 

Directions on the sky are specified by giving the angle between the given 
direction and the direction to the NCP. This position angle is convention- 
ally measured positive from north through east. Astronomical images and 
maps are usually shown with north upwards; this orientation places east on 
the left, as one may convince oneself by facing to the south and looking up at 
the sky. Since right ascension increases from west to east, it increases from 
right to left on astronomical maps. 


2.1.2 Galactic coordinates 


Although the equatorial system constitutes the fundamental observable sys- 
tem, it is clearly geocentric, and, as such, provides an inappropriate view- 
point for problems of Galactic structure and dynamics. It is useful, therefore, 
to set up a Galactic system of coordinates that has a direct physical con- 
nection with the structure of our galaxy. The Galactic equator is chosen 
to be the great circle that most closely approximates the plane of the Milky 
Way. This plane is inclined at an angle of 62.87° to the celestial equator. 
The north pole of the Galactic system [the north Galactic pole (NGP)| 
is located at (agp, ap) = (192.859 48°, 27.128 25°) ~ (12851™, +27°7.7').2 
The Galactic latitude 6 of a star is the angle from the Galactic equator to 
the star along the great circle through the star and the Galactic poles; for 
example, the NGP is at b = +90°, and the south Galactic pole (SGP) is at 
b = —90° (see Figure 2.5). 

Galactic longitude / is measured with respect to the direction to the 
Galactic center. (The compact radio source, Sgr A*, now believed to mark 


2 The cited values (acp, dap) are for epoch 2000 — see §2.1.5. 
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Figure 2.6 The parallax @ of a star 
is the angle subtended at 1 AU. (Not 
to scale) 


the position of the Galactic nucleus, actually lies about 5’ away from this 
point.) Thus, the direction to the Galactic center is 1 = 0°,b = 0°, which 
corresponds to (a, 6) = (266.405°, —28.936°) ~ (17545.6™, —28°56.2').3 In 
texts on spherical astrometry (for example, Green 1985) it is shown that the 
Galactic coordinates (J,6) of a star with equatorial coordinates (a, 5) may 
be obtained by solving these equations: 


sin b = sin dgp sin 6 + cos dgp cos 6 cos(a@ — agp) 
cos bsin(Icp — 1) = cos ésin(a — agp) (2.1) 
cos bcos(Iop — 1) = cos dgp sin 6 — sin dgp cos 6 cos(a — agp). 


Here Icp = 123.932° is the longitude of the NCP. The inverse transformation 
is given by 


sin 6 = sin dgp sin b + cos dgp cos bcos(Icp — 1) 
cos 6 sin(a — agp) = cos bsin(Icp — !) (2.2) 


cos 6 cos(a — agp) = cos gp sin b — sin dgp cos bcos(Icp — 1). 


2.1.3 Parallax 


As the Earth revolves around the Sun, the vantage point from which we view 
the stars continually changes. Thus, their apparent directions also change 
slightly. Consider Figure 2.6, and suppose the Earth is initially at 1; the 
direction to a star S is then along the line £,S. Six months later, when 
the Earth is at H2, diametrically opposite the Sun, the star will appear in 


3 Actually, there have been two systems of Galactic coordinates. The old system (I!, b!) 
measured Galactic longitude I! from one of the points where the Galactic equator intersects 
the celestial equator. This choice, which lacked physical motivation, was superseded in 
1958 (Blaauw et al. 1959). In some older work, the | and b defined here are referred to as 
Ul and bl, The cited values of (a, 5) are for epoch 2000. 
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the direction E2S. During the course of the year, the apparent position of 
a star traces out an elliptical path called the parallactic ellipse. The lines 
E,S and E2S contain an angle at S that is defined to be 2~, where aw is 
the parallax of the star. If r is the radius of the Earth’s orbit, and d is the 
distance from the Sun to the star, then, because d >> r, @ is a small angle, 
and . 

—=tanw = wrad. (2.3) 


d 


If we convert to seconds of arc, 
w" = 206 265m rad, (2.4) 


and define the astronomical unit (AU) such that r = 1 AU, then we have 


_ 206 265 


W 


d AU. (2.5) 


wW 


Our fundamental unit of distance, the parsec (pc), is defined as the 
distance at which a star would have a parallax of 1”. We have 


1 pc = 206 265 AU = 3.086 x 10'3 km = 3.26 light-years, (2.6) 


and the distance to a star with observed parallax w” is 


1 


For Galactic-structure work, it is convenient to use the kiloparsec (kpc) = 
10° pe as a distance unit, and, for the discussion of distances between galaxies 
and in the Universe at large, the megaparsec (Mpc) = 10° pc is employed. 
As is obvious from equation (2.7), the greater the distance to an object, the 
smaller is its parallax. 

Equation (2.7) shows that a star’s parallax is the reciprocal of its dis- 
tance. On account of this close relation, the word ‘parallax’ has for as- 
tronomers come to be nearly synonymous with ‘distance’. In fact, the dis- 
tance to a star that has been obtained by measuring the star’s brightness 
(§3.5.2) is often called a photometric parallax, while a distance that has 
been directly obtained by observing a parallactic ellipse is called a trigono- 
metric parallax. In general one is liable to cause needless confusion by 
using the term ‘parallax’ for anything other than an inverse distance. 

Because stars are far away, their parallaxes are small. For several hun- 
dred years, attempts to measure them to “prove” the heliocentric theory 
of the solar system were frustrated by the difficulty of measuring shifts in 
position of less than 1”. It was not until 1838 that F. W. Bessel finally de- 
termined the parallax of the star 61 Cygni to be 029. It is now possible to 
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measure parallaxes out to distances of the order of 200 pc. The corresponding 
parallaxes are conveniently measured in milli-arcseconds (mas). 

The classical procedure for determining a parallax is in principle simple. 
An image of star S is taken when the Earth is at E,, and another is taken 
when at F2. The position of star S is measured on each image with respect 
to faint background stars (which are presumably very distant), and the dif- 
ference in these positions is then twice the relative parallax. Naturally, 
the procedure is more complicated in practice. Since the atmosphere blurs 
the images of stars into splodges one arcsec or so in radius,* many images are 
required and great care has to be taken in making the measurements. More- 
over, the reference stars do not actually lie at infinite distances, and they 
must therefore have finite (if small) parallaxes themselves. They will thus 
move on parallactic ellipses similar to that of star S (but, ideally, of much 
smaller size), and a correction has to be applied to reduce the measured rel- 
ative parallax to an absolute parallax, which refers to a frame truly fixed 
in space. The reduction to absolute parallax is usually made by applying a 
statistical correction, which depends on the brightness of the reference stars 
and Galactic coordinates of the field —- see Green (1985) or McNally (1974) 
for details. 

The scope for error in all these reductions and corrections is very great, 
and classical trigonometric parallaxes are notoriously beset by large system- 
atic errors which one could neither eliminate nor even reliably estimate. The 
entire field has recently been revolutionized by a special-purpose astrometric 
satellite, called Hipparcos. 

Hipparcos, which collected data from November 1989 to March 1993, was 
able to measure directly the angles between stars with unprecedented accu- 
racy because in space: (i) there is no atmospheric refraction; (ii) a telescope 
is weightless and therefore does not flex as it slews over the sky;> and (iii) it 
is possible to obtain diffraction-limited images. As a result of these advan- 
tages of working in space, Hipparcos measured the angles between stars that 
are widely separated on the sky with errors that are often only ~ 0.5 mas. 
Hipparcos was able to determine accurate positions and trigonometric paral- 
laxes for over 120000 stars. However, since it was working with a relatively 
small telescope, these stars were all comparatively bright.® 

The largest known parallax is 075 for a@ Centauri, which places it at 
a distance of 1.3pc. The second largest is 0/[55 for Barnard’s star, which 
is at a distance of 1.8pc. The typical errors in Hipparcos parallaxes are 
1 mas, so the distance of a star at 200 pc (w = 5 mas) is uncertain by about 


4 This blurring is called ‘seeing’ — see §3.2.3. 

5 Hipparcos measured the angles between stars that are widely separated on the sky 
by using rigid optics to superimpose two different fields in a common focal plane. 

6 The Hipparcos catalogue (Perryman et al. 1997) is complete only for apparent visual 
magnitude (§2.3) V < 8, although it does contain some parallaxes for stars as faint as 
V=12. 
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20%, which implies a 40% uncertainty in the determination of its intrinsic 
brightness (see §2.3.3). 

The limitation that accurate trigonometric parallaxes are available only 
for nearby stars is a severe one, for there are types of stars of great astrophys- 
ical importance that are too rare to be found within the surveyable volume. 
To calibrate the intrinsic brightness of such stars, we must develop alternative 
methods for estimating their distances. Some of these methods are geometric 
in nature and hence are in some sense as fundamental as trigonometric mea- 
sures (though in most cases they invoke additional hypotheses). However, 
for many stellar types, the calibrations can be made only by comparing the 
apparent brightnesses of these stars with those of stars whose brightnesses 
have already been calibrated geometrically and with which they are known 
to be physically associated (for example, in binaries and clusters). In fact, 
trigonometric parallaxes for nearby stars play a key role in fixing the intrinsic 
brightness of practically every other kind of astrophysical object. and thus 
ultimately in setting the distance scale for the entire Universe (Chapter 7). 


2.1.4 Proper motions 


Because of their intrinsic motions in space, the stars slowly change position 
with respect to one another on the sky. The component of a star’s motion 
across the line of sight, after the motion of the Earth around the Sun has been 
removed, produces an angular rate of change in position termed proper mo- 
tion. This angular rate of change is directly proportional to the star’s linear 
velocity perpendicular to the line of sight and inversely proportional to star’s 
distance. Therefore, if we can measure a star’s proper motion and estimate 
its distance, we can determine its transverse velocity — two components 
of its three-dimensional space velocity relative to the Sun (the other com- 
ponent is the radial velocity; see §2.2.1). The space motions of stars can 
be analyzed (see §10.3) to give information about the distribution of stellar 
velocities about the center of the Galaxy. Hence proper motions are an es- 
sential component in the development of our knowledge of the kinematics — 
and thus, ultimately, the dynamics — of the Galaxy. 

Proper motion is a vector quantity and has both a magnitude yp, typically 
measured in mas per year, and a direction, which may be specified by its 
position angle 6. In the equatorial system, the motion is usually resolved 
into two components: jcos@ = ps, which is perpendicular to the celestial 
equator, and wsin@ = pocosdé, which is parallel to the celestial equator. 
Here, fq is the annual rate of change in right ascension (often expressed in 
seconds of time instead of seconds of arc) and the factor cosé comes from 
the convergence of hour circles to the celestial poles. 

Proper motions are usually tiny; the largest known is that of Barnard’s 
star, which has a motion of 103 per year. Typical measured motions are 
only a few hundredths of a second of arc per year. Since the phenomenon 
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of ‘seeing’ (see §3.2.3) blurs the images of stars to finite-sized splodges, it is 
clear that many years must elapse before these small motions accumulate to 
an accurately-measurable effect. Nonetheless, it is now possible to measure 
the proper motions of stars as far away as the Galactic center (Spaenhauer, 
Jones & Whitford 1992). 

The secret of getting a good proper motion is obviously to compare the 
positions of stars at widely different epochs. Since Hipparcos only observed 
the sky for about three years, proper motions obtained from Hipparcos data 
alone are not remarkably accurate despite the unprecedented accuracy of 
Hipparcos positions. Often a more accurate proper motion can be obtained 
by combining a Hipparcos position with a less accurate position from old 
plate material. This fact and the ability of computers to make short work 
of extensive observational reductions have revived interest in the Astro- 
graphic Catalogue, an ambitious sky-mapping program that was carried 
on around the World in the years 1892-1950.” The U.S. Naval Observatory 
is re-reducing all the data, starting from the (x,y) coordinates of stars on 
individual plates. The final result will be an all-sky catalogue that contains 
the positions at epoch ~ 1900 and accurate to ~ 0/5, of seven million of stars 
to magnitude V ~ 12.5. These data will play a vital rdle in complementing 
the Hipparcos positions with accurate proper motions, which are essential if 
we are to carry the positions of Hipparcos stars forward in time. 


2.1.5 Precession and nutation 


Both Newtonian physics and (for an effectively isolated system) Einstein’s 
theory of relativity recognize the existence of a privileged class of ‘inertial’ 
frames of reference. For a variety of reasons, it is highly desirable to record 
the positions of astronomical objects relative to such an inertial frame. Un- 
fortunately, the system of equatorial coordinates described in §2.1.1 is not 
inertial because its two defining directions, the NCP and the VE, both move 
on complicated paths over the celestial sphere, with the result that the equa- 
torial coordinate system is constantly rotating with respect to inertial space. 

In view of this motion, astronomers do not actually record positions 
relative to the NCP and VE, but relative to the directions that on a given 
date nearly coincided with the directions of the NCP and VE. To explain 
how these reference directions are defined, we have to digress briefly into the 
dynamics of the solar system. 

Since we are interested in changes that occur over many years, or hun- 
dreds and thousands of months, we can treat the Moon’s mass as if it were 
uniformly distributed over the ring around the Earth that the Moon sweeps 
out in the course of a month. This ring is inclined by about 5° to the plane 
of the ecliptic, and the Sun and planets apply a torque to it. It responds 
to these torques by precessing like a top. That is, its axis slowly moves in 


The AC was probably the first great international scientific collaboration. 
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a small circle around the ecliptic pole with a period of 18.6 years. Hence, 
averaged over times far in excess of 18.6 years, the Moon’s mass acts like a 
band around the Earth whose axis points to the ecliptic pole. 

The Earth is not spherical but flattened. The Earth’s flattening allows 
the Sun and the Moon to exert torques on the Earth, and, because the Earth 
spins like a top, its axis of rotation precesses about the ecliptic pole. The 
period of this luni-solar precession is long, about 25 800 years. 

Finally, the shape and orientation of the Earth’s orbit about the Sun 
is changing on a time scale of tens of millions of years in response to the 
forces applied by the planets. Since the VE is defined as the intersection on 
the celestial sphere of the great circles of the ecliptic and the equator, any 
change in the direction of the ecliptic is liable to change the VE. 

By the late 19th century the theory of all these motions had been worked 
out in great detail and put into definitive form by Simon Newcomb. New- 
comb obtained Fourier series for the positions of the NCP and the VE as 
functions of time. These series contain terms with periods of order 18 years 
and shorter, and terms with periods of order 26000 years and longer. The 
motion described by the short-period terms is called nutation. The po- 
sitions predicted for the NCP and VE when these short-period terms are 
dropped are called the positions of the mean NCP and mean VE. They 
are the positions of the NCP and the VE when averaged over a time long 
compared to 18.6 years and short compared to several millennia, and are the 
positions actually used by astronomers when measuring (a, 6). 

Of course these mean positions are still moving with respect to inertial 
space, so it is essential to specify the epoch at which (a,5) were measured. 
In practice only a few epochs are employed: 1900, 1950 and the current 
standard, 2000. That is, quoted values of (a, 6) are positions relative to the 
expected locations of the mean NCP and mean VE at the very beginning of 
1900, 1950 or 2000. One speaks of precessing equatorial coordinates from 
one epoch to another because this coordinate transformation largely consists 
in allowing for the precession of the NCP about the pole of the ecliptic. At 
present the VE moves by 5025 per year. 

Unfortunately, our knowledge of the mass distribution within the Earth 
does not allow us to calculate the luni-solar precession with sufficient ac- 
curacy. Thus, it has to be determined by observing the apparent motion 
of celestial objects to which it gives rise. The natural objects to study for 
this purpose are distant galaxies and quasars, because these objects are so 
distant that, for any plausible transverse velocities, their motions relative 
to an inertial frame will be undetectably small. Unfortunately, such objects 
appear very faint, and it has only recently become feasible to use them to 
define an inertial system — see below. Observations of them currently yield 
for the value of the luni-solar precession constant 50/3851 + 0/0004 yr7?. 
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2.1.6 Astrometric systems 


Ground-based optical observations can determine the relative positions of 
objects that are close to one another on the sky very much more easily than 
the angles between widely separated objects: the angles between all objects 
that lie within a single field of view of a telescope can be determined simply 
by measuring the distances between their images on a photographic plate or 
CCD. However, atmospheric refraction, telescope flexure and other effects 
make it very difficult to determine the exact location of the field. Conse- 
quently, astrometrists proceed by first determining the absolute positions 
of certain fundamental stars that are distributed all over the sky, and then 
relate the positions of other objects to the nearest fundamental object; such 
positions are called relative positions. For example, the first step in the 
re-reduction of the Astrographic Catalogue plates at the U.S. Naval Obser- 
vatory was the determination of absolute positions of several stars on each 
plate — the so-called ACRS stars.® 

An astrometric system comprises a catalog of objects with absolute 
positions and proper motions. These data are then used to convert positions 
and proper motions measured relative to the fundamental objects of the 
astrometric system to absolute values in that system. 

The classical procedure for determining the absolute position of a star 
is to note its time of passage through the meridian, while simultaneously 
measuring its zenith distance. These measurements are taken with a special- 
purpose telescope, called a meridian circle, which is mounted so that it 
can point only along the meridian. Each year the Carlsberg Automated 
Meridian Circle on La Palma measures the positions of about 12000 stars — 
see Appendix B. 

It is unfortunate that the reference stars observed by meridian-circles 
are bright, because bright stars tend to be nearby, and thus to have large 
proper motions. Hence, our astrometric reference objects are moving all over 
the place, and it is essential to determine what these motions are so that one 
can say where any given reference point will be at any given epoch. But, 
how is one to determine these motions, given that the reference directions of 
the coordinate system one uses are themselves moving in an unknown way? 
Theory tells us the direction of the luni-solar precession, even if it cannot 
tell us its magnitude. The standard escape from this dilemma is thus to 
assume that any apparent average motion of the bright stars in the direction 
of the luni-solar precession is caused by precession, and hence to determine 
the magnitude of the precession. 

The most accurate astrometric system based on meridian-circle ob- 
servations is the FK5 system? (see Appendix B), in which positions and 
proper motions of the reference stars have typical accuracies 0.03 arcsec and 
~ 0.8 mas yr~!, respectively. Position and proper-motion catalogs that were 


8 ACRS is the acronym of Astrographic Catalog Reference Stars. 
° FK5 stands for the fifth Fundamental Katalog. 
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published before 1997 were usually tied to either the FK5 system or one of 
its predecessors. 

A superior astrometric system, called the International Celestial 
Reference System (ICRS), has now been established by observing ex- 
tragalactic objects (Arias et al. 1995). These objects are so distant that 
their proper motions are almost certainly unobservably small. They are too 
faint for their absolute positions to be determined with a meridian circle, 
but many of them are radio sources, and radio interferometers can mea- 
sure accurately the angles between sources that are widely separated on the 
sky. Hence by radio interferometry one can establish, without reference to 
the Earth, accurate relative positions for a network of extragalactic objects 
that are distributed throughout the sky. Since the true proper motions of 
the ICRS objects must be negligible, they form a reference system that is 
inherently superior to any that is based on Galactic stars. 

As was mentioned in §2.1.3, Hipparcos, like a radio telescope, could 
measure accurately the angles between stars that are widely separated on 
the sky. This capability made it possible to put every star in the Hipparcos 
catalogue on the ICRS by adding to the Hipparcos observing list just a few 
objects whose positions could be determined on the ICRS by radio interfer- 
ometry. Significant errors were uncovered in the FK5 system, especially at 
southern latitudes, where errors as large as 0.16 arcsec have been detected. 

Another important catalog with positions on the ICRS is the Northern 
Proper Motion catalog (NPM), the first part of which was recently completed 
after decades of work at the Lick Observatory (Klemola, Jones & Hanson 
1987, Hanson, Klemola & Jones 1994). It contains the positions and proper 
motions of 149000 rather faint stars located throughout the portion of the 
sky that lies north of the Milky Way (Appendix B). It nicely complements 
the Hipparcos catalogue, which is complete for fairly bright stars. 


2.2 Distances determined from velocities 


We have seen that the proper motions can now be measured for stars that 
are as far away as the Galactic centre. If one knows both the distance and 
the proper motion of a star, then its transverse velocity can be calculated. 
To complete one’s knowledge of the star’s velocity vector v it remains only 
to determine the component v, of v along the line of sight. Traditionally v 
is called the star’s space velocity and v, is called its radial velocity. The 
sign of v,, is taken to be positive if the source moves away from the observer 
and negative if it approaches. 
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2.2.1 Radial velocities 


When a source of radiation moves toward or away from an observer, the ob- 
served frequencies of photons will be different from their emitted frequencies. 
Quantitatively, if v, is the radial velocity of a source that emits photons that 
have frequency vo in the rest frame of the source, then these photons will be 
detected at frequency 


B=o,/c, 


y= (1- p*)-¥?. ee) 


v =(1—)yv, where { 


Here c is the velocity of light. To the lowest order in v/c, y = 1 and the 
frequency shift is 
Av =v—v = —Bv. (2.9) 


In terms of wavelengths \ = c/v we have to this order in v/c that AX/Ap = 
—Av/v so 


A\= do, (2.10) 


Thus AA for a receding source is positive, and the observed spectrum is 
redshifted relative to its rest wavelength; for an approaching source, AX is 
negative and the observed spectrum is blueshifted. In general, the redshift 
of an object is defined to be 


2a AAG: (2.11) 


When we examine the spectrum of a star, we can often identify the ob- 
served spectral lines with those produced by various chemical elements in 
definite states of excitation and ionization; from terrestrial experiments we 
know the rest wavelengths of these lines, and by measuring the positions of 
the lines in the star’s spectrum, we can determine their wavelength shift and 
hence the radial velocity of the object with respect to the spectrograph. Af 
ter correcting for any components of the Earth’s orbital velocity (30kms~*) 
and rotation velocity (0.5kms~1) along the line of sight we obtain the he- 
liocentric radial velocity of the object. 

One often requires values of v, that are accurate to +lkms7! ~ 
3 x 10~%c. In this case the wavelength must be determined to a few parts in 
a million. Fortunately, algorithms have been developed that enable surpris- 
ingly accurate radial velocities to be extracted from mediocre spectra — see 
§11.1.1. Since these algorithms are computationally intensive, historically 
many of the best radial velocities in the literature have been determined 
with instruments that implement the algorithms in hardware (Griffin 1967, 
Baranne, Mayor & Poncet 1979). With such instruments, velocities can be 
rather easily measured to an accuracy of better than +1kms~!. In fact, the 
velocities of reasonably bright stars can be measured to the limit in accuracy, 
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~ +0.1kms7!, that is set by turbulence in stellar photospheres (e.g., Griffin 
et al. 1988). 

Some stars are observed to have variable radial velocities. These may be 
caused by the orbital motion of a star around a companion or by pulsation 
of the star — see §§3.1.2 and 5.1.10. In either case, the variations in radial 
velocity yield astrophysically important information. 


2.2.2 Distances from the moving-cluster method 


Imagine watching a time-lapse movie of a cluster of stars as it recedes from 
the Sun. At first it occupies a appreciable patch of the sky, but gradually it 
shrinks to no more than dot of light in the distance. Let us assume that the 
cluster has a constant physical diameter, d. Then its angular diameter is 


d 
g= — 2.12 
D ? ( ) 
where D is the cluster’s distance. Taking natural logarithms of both sides of 
this equation and differentiating with respect to time, we easily find that 


0 
D= —7zUr, 2.13 
; (2.13) 


where v, = D is the cluster’s radial velocity. Hence, by measuring both 
vu, and the rate —@/0@ at which the cluster is apparently shrinking, we can 
determine the distance D. Clearly, the same logic applies to the case of a 
cluster that is approaching us. If the cluster is not travelling directly away 
from us, it will move across the sky as it shrinks or expands. 

Clearly, the proper motions of individual stars within a receding cluster 
will be directed towards the point on the sky that the whole cluster will 
occupy as it shrinks to a point. Hence, this point is known as the convergent 
point of the cluster. In the case of an approaching cluster, the convergent 
point is the point from which the cluster has expanded. In fact, many clusters 
have been discovered by searching proper-motion catalogs for stars whose 
proper-motion vectors all intersect in a single point, the convergent point of 
a moving cluster (Figure 2.7). 

The convergent point is defined by the proper motions of cluster stars 
but it contains key information about the space velocity v of the stars: it 
defines the direction v/|v| of this velocity. To see this, imagine the situation 
when the stars in a receding cluster have finally reached the convergent point 
on the sky. At this time they cease to move across the sky because they are 
moving directly away from us. 

We now show that once a cluster’s convergent point has been identified, 
we can use it to determine distances to individual cluster stars. Figure 2.8 
shows the plane that contains the Sun, a cluster star and the velocity vector 
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Convergent Figure 2.7 Schematic diagram of 

point a moving cluster. When the proper 
motions of the cluster stars are ex- 
tended, they appear to intersect at a 
point of convergence on the sky. 


Figure 2.8 The space motion of a 
Vv star in a moving cluster is assumed 
to be parallel to the line of sight 
from the Sun to the apparent conver- 
gent point of the cluster. Then the 
angle w is both the angle between 
the positions of the star and of the 
convergent point on the sky, and 
the angle between the star’s space- 
velocity vector and the Sun-star line 
of sight. 


To convergent 
point 


v of the cluster. As we have seen, v is in the direction of the convergent 
point. In the notation of the figure, the star’s radial velocity is v, = vcosy, 
and its velocity transverse to the line of sight is vy, = vsiny. Thus 


Ut = v, tany, (2.14) 


which gives v; in terms of directly observable quantities. But v, = ud, where 
wis the magnitude of the star’s proper motion and d is its distance. Hence 


rt rt 
cS Ge. ge (2.15) 
d be 
Equation (2.15) now yields 
4.74 7 —1 
v= (—3) o, (2.16) 
mas tanw \kms~—! mas yr~! 


From these equations we can obtain the distance (or parallax) of each clus- 
ter member from its observed proper motion, radial velocity, and angular 
separation from the convergent point. 

In practice, the procedure is somewhat more complicated than that just 
outlined. Details are described in Hanson (1975) and the references cited 
therein. The method works best when the angular size of the cluster is 
sufficiently large that the proper motions of the individual cluster stars are 
oriented in substantially different directions and thus define a convergent 
point accurately. Furthermore, the cluster must be close enough that the 
proper motions are large enough to be measured accurately, the angle 
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must be such that the space motion makes a significant contribution to both 
vp and v,. Finally, cluster members must be reliably identifiable so that 
foreground and background field stars can be excluded. 

The moving-cluster method has been applied to a number of clusters, 
including: (i) the Hyades, which contains about 200 stars at an average dis- 
tance d = 45 pc (van Altena 1974, Hanson 1975, Hanson 1980, Schwan 1991, 
Perryman et al. 1995); (ii) the Ursa Major group, which contains about 60 
stars at an average distance of about 24pc (Eggen 1958, 1960); (iii) the 
Pleiades, which contains about 600 stars at an average distance of about 
115 pe (van Leeuwen & Hansen Ruiz 1997); and (iv) the Scorpio-Centaurus 
group, which contains about 100 stars at an average distance of about 170 pc 
(Bertiau 1958). Historically, the distance to the Hyades cluster was of enor- 
mous importance because the cluster includes types of stars whose distances, 
and hence whose absolute magnitudes, could not be measured by the trigono- 
metric parallax method. In fact, until very recently, the distance to the 
Hyades set the scale for essentially all Galactic and extragalactic distance 
measurements, and revisions of it have had the most far-reaching repercus- 
sions. Schwan (1991) determined 45.3 + 2.1 pc from the classical moving 
cluster method, while Gunn et al. (1988) obtained 47.9+ 0.9 pc from a mod- 
ified version of the moving-cluster method that employs radial velocities. 
The Hyades cluster is less important now that trigonometric parallaxes can 
be obtained at its distance and beyond. Indeed, parallax and proper-motion 
data are now so accurate that it is possible to determine which stars lie on 
the near side and which on the far side of the Hyades, so the whole concept 
of a single distance to the Hyades is outdated. Of course, from individual 
distances one can determine the distance to the cluster’s barycenter: Brown 
et al. (1997) find d = 46.34 0.3 pc. 


2.2.3 Secular parallaxes 


In a frame in which the mean of the velocities of the nearby stars vanishes, 
the Sun, like other stars, has a non-zero space velocity, Vo. Let us assume 
that we know the Sun’s space velocity vo, and that it has magnitude |vo| ~ 
20kms7?. 

The trigonometric parallax of a star is half the angular displacement 
on the sky produced when the star is viewed from opposite sides of the 
Earth’s orbit. As we have seen, this effect cannot be reliably measured for 
stars more distant than d ~ 200pc. Obviously, we could extend the range 
of our distance determinations if we could use a longer baseline than the 
2 AU diameter of the Earth’s orbit. The Sun’s non-zero space velocity makes 
this extension possible: the Sun’s motion at around 20kms~! relative to 
the average of nearby stars implies that the solar system moves about 4AU 
per year, so observations separated by, say, twenty years give us an effective 
baseline forty times larger than that provided by the Earth’s motion around 
the Sun. The method of secular parallaxes exploits this longer baseline. 
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Figure 2.8 The motion relative 
to the Sun of stars on a sphere of 
radius x gives rise to both radial 
velocities and proper motions. In 
some directions the proper motion 
vanishes, while in others the radial 
velocity vanishes. 


We focus on stars that lie within a spherical shell, radius z, centered 
on the Sun —- we can pick out such stars by selecting stars that all have 
roughly the same luminosity (see §3.6.4) and similar apparent brightnesses. 
For the moment we assume that these stars have a common space velocity v 
relative to the Sun. At most points on the sky v, contributes to both radial 
velocities and proper motions, but in the direction of v proper motions will 
vanish and radial velocities will be equal to the magnitude of v (Figure 2.9). 
Hence, by measuring radial velocities over the sky, we can determine v. The 
radius of the sphere can now be determined from the magnitude py. of the 
proper motions in directions at which the radial velocities vanish (Figure 2.9). 
Specifically, we have 

c=v/p. (2.17) 


Equation (2.17) is in practice not very useful for two reasons. First, 
the only proper motions it uses are those of stars that lie in directions ex- 
actly perpendicular to v. Second, it assumes that all stars have a common 
space velocity, whereas in reality the velocities of individual stars will scatter 
around some mean motion. By formulating our idea more mathematically, 
we can resolve both these problems. 

Let v; be the velocity of the i** star in the frame in which the mean 
velocity of the stars of our group is zero: }); vj = 0. The heliocentric velocity 
of the i* star is 

u; = Vi — Vo; (2.18) 


where V@ is the solar motion with respect to the stars under investigation. 
In principle, v@ should be determined by measuring the radial velocities 
of these stars. Since measuring large numbers of radial velocities is very 
telescope-time consuming, a value for v@ is frequently adopted from some 
other sample of stars. This procedure is acceptable provided that the mean 
velocities of the two samples are similar. 
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Sun 


Figure 2.10 Left: on the sky, the i*® star is at %;, while the Sun is moving in the direction 
Vo. The angle between these directions is 7; = arccos(X;-¥@). The proper motion vector 
[, is decomposed into components parallel and perpendicular to the great circle joining 
%; to Ve. Right: the plane containing the Sun the star and Vo. 


The proper motion p, of the i** star is proportional to the component 
of u; that is perpendicular to the position-vector x; of the star. With the 
convention that for any vector b, b = b/d is the unit vector in the direction 
of b, we have 


fy SE A Se) (2.19) 
x; Xi 
ps; is a vector on the sky based on the location %; of the i“ star — see 
Figure 2.10. We now resolve it into its component 2); parallel to the line 
(great circle) that joins X; to Vo, and its perpendicular component, pr, ;.1° 
Consideration of the geometry of the plane containing the Sun, the star and 
Ve (see Figure 2.10) shows that 


II 


Pili sin Wi; 


a ) (2.20) 


_ (%i X Vo): ((vi — Vo) x %) 
_ x, sin Y; . 


Here we have used the vector identity (a x b)-c = (b x c)-a to get from 
the second to the third line. Solving for x; and bearing in mind that 7; is 
the angle between x; and Vo, we find 

= Ve |X x Vol? _ (Xi x Vo) . (x; x V;) 


fyi Sin Yi Hy; Sin Yi (2.21) 
sin? i (Ri X Va) - (i X Va) , 
© py sin Yi My sin Yi 


10 Tn the literature, the symbols || and 1 are denoted v (upsilon) and 7 (tau). These 
names are not mnemonic, and only people with good eyesight can distinguish v from v. 
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The idea now is to average over all the N stars in our group by summing 
over i and dividing by N. The second term in the second line of equation 
(2.21) then vanishes, since at any point on the sky, v; averages to zero by 
our choice of reference frame. However, we have only a finite number of 
stars, and it pays to take care how we do this averaging. First we multiply 
through by sin 7; so that small denominators do not bother us. Then, since 
we are seeking a parallax, rather than a distance, we multiply through by 
wi; =1/2;. Finally, we average each term and discard the term proportional 
to v;, to find 
_ (mj: sin vi) 


(@) = ‘ig (ae) (2.22) 


A parallax obtained from equation (2.22) is called a secular parallax. 


2.2.4 Statistical parallaxes 


We now introduce an additional assumption: we assume that the velocities v; 
are isotropically distributed. Specifically, we assume that the average value 
of |@- v;| is independent of the unit vector é. If this assumption holds, we 
can derive from the perpendicular components of proper motion yp, ; defined 
in the last subsection, an independent estimate of the distance to the sample 
of stars. 

The observed radial velocity of the i*® star is 


lI 
~ 


i+ (Vi ~ Vo) 
i+ Vi — Vo Cosi, 


Uri 
(2.23) 


lI 
~*~ 


where the angle 7; is defined in Figure 2.10. The component of v; perpen- 
dicular to the plane containing the Sun, the star and vo is zjys,,, and, by 
our hypothesis that the mean magnitude of any component of v; is the same, 
we have that 


(Re val) = (eames) - (2.24) 


Substituting this equality into equation (2.23), and assuming that all stars 
are at the same distance % = 1/7, we find 


(Hail) 


a (Uri + Ue cos pi)” 


(2.25) 


A parallax obtained from equation (2.25) is called a statistical parallax. 
A more sophisticated analysis, which takes into account the ellipsoidal 
shape of the random velocity distribution (see §10.3.2) is described in §3.73 of 
Trumpler & Weaver (1953). Also, when we apply the secular and statistical 
parallax methods to distant stars, the equations given here should, strictly 
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speaking, be generalized to allow for the effects of differential Galactic rota- 
tion. This generalization is in principle straightforward, but lies outside the 
scope of our discussion. 

If (/&-v|) is greater than ve, the statistical parallax is probably more 
reliable than the secular one, because the latter exploits the solar motion, 
while the former depends essentially on the stars’ random velocities. Con- 
versely, if vg is greater than (|X-v|), the secular parallax is probably the 
more reliable. 

The secular and statistical parallax methods have been used to estimate 
the distances, and hence the absolute magnitudes, of rare but luminous stars. 
Until Hipparcos completed its work, these stars, which play an important réle 
in setting the scale of the Universe, were found only beyond the range of the 
trigonometric parallax method (d ~ 20 pc), whereas the secular and statisti- 
cal parallax methods could determine their distances out to d ~ 500 pe. 

At the present, anomalous, moment in astronomy, when Hipparcos has 
provided positions and parallaxes at one epoch for which no comparable data 
exist from earlier epochs, the secular and statistical parallax methods are 
less important than they were, although the volume around the Sun within 
which they work is still an order of magnitude larger than that accessible 
to trigonometric parallaxes. They will probably suffer a renaissance if a 
second astrometric satellite flies early in the next century, and extremely 
high-precision proper motions become available. 


2.3 Magnitudes and colors 


Every astronomical body emits radiation over much of the electromagnetic 
spectrum. Ideally, we would like to measure the complete spectral distribu- 
tion of this radiation. Then we could determine the energy received in terms 
of the flux per unit frequency interval f, in Wm~*s~! Hz~?, or the flux per 
unit wavelength interval f, in Wm~?s-!nm7~}, over the entire spectrum. 
In practice, the measurement of such absolute energy distributions is 
difficult, for two reasons. First, the absolute response of the observing equip- 
ment must be determined. Second, different frequencies of electromagnetic 
energy penetrate the Earth’s atmosphere to different depths. Consequently, 
at many frequencies observations can be made only from special sites such 
as the South Pole (Lane & Stark 1996) or even from space — see Figure 2.11. 
Hence, to obtain the absolute energy distribution of an object one has to 
combine data from many different instruments, and usually data for some 
frequency ranges will be lacking. 

For many purposes, we do not need to find the complete energy distri- 
bution of an object but merely the total energy received by a detector in 
some definite range of frequencies, that is, the integrated radiation flux f 
measured in units of W m~?, contained in a particular frequency range Av. 


2.3. Magnitudes and colors AT 


6 
2 
2 x 
radio IR ° Uv ~ray y-ray 
— $f 
satellite 
102 rocket 
balloon 
= 
Re r 
S10 aircraft 
» 
G 
J 
g mountain 
1 
sea— 
level 


log(A/m)W—L-_1__1_1 1 J J jy dt 
B(X/m), 1 O -1 -2 -3 -4 -5 -6 -7 -8 -9-10-11-12 
isaio/iiey: 

7 


6 9 10 11 12 13 14 15 16 17 18 19 20 21 


Figure 2.11 The depth in the Earth’s atmosphere to which electro- 
magnetic radiation penetrates as a function of frequency. The line 
indicates the altitude at which the flux of an astronomical source 
is attenuated by approximately a factor of e = 2.718.... The alti- 
tudes for various observing locations and the standard division of 
the spectrum into wavebands are marked. 


In particular, we can use detectors such a CCDs to measure the apparent 
brightnesses of objects in various bands in the optical and infrared regions 
of the spectrum; this procedure is called astronomical photometry — see 
Budding (1993), Sterken & Manfroid (1992) or Hayes, Pasinetti & Davis 
Philip (1985). 


2.3.1 Apparent magnitudes 


The apparent brightnesses of stars are expressed in terms of their apparent 
magnitudes. The ancient Greek astronomers divided stars into six magnitude 
groups judged to be separated by equal steps in brightness. The brighter stars 
were of the first magnitude, and the faintest that could be seen by the naked 
eye were of the sixth magnitude. From subsequent physiological studies, 
it was learned that equal steps of brightness sensed by the eye correspond 
fairly well to equal ratios of radiant energy; that is, the response of the eye 
to stimulus by light is essentially logarithmic in intensity. Thus, if m, and 
mz denote the magnitudes assigned to stars with energy fluxes f, and fo, 
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then 
= fi 
my — Tia = SK logy (2), (2.26) 
fr 
where the minus sign is chosen so as to assign smaller numerical values to 
brighter stars. 

Photometric studies in the nineteenth century showed that sixth mag- 
nitude stars are about a hundred times fainter than first-magnitude stars. 
Hence, following the suggestion of N. Pogson, the magnitude system was 
defined such that a difference of 5 mag corresponds exactly to a factor 
of one hundred in the ratio of radiation fluxes. Thus, for (f1/f2) = 100, 
Mz — m1 = 5. Hence, in equation (2.26), & = 2.5, and, in general, 


m,—-mM2=> —2.5 logy (2), (2.27) 
2 
or 
Fr <4 9-0-4-m—ma), (2.28) 
fe 


Interestingly, m, — m2 = 0.921 log.(fo/fi), so that the scale defined by the 
ancient Greeks was essentially one based on natural logarithms. In particu- 
lar, if (f1/f2) =1+ Af is close to unity (Af < 1), then 


Am = m2 —m, & 1.086Af; (2.29) 


that is, the magnitude difference (when small) between two objects is about 
equal to the fractional difference in their relative brightnesses. It follows 
from equation (2.28) that a magnitude difference of 1 mag corresponds to a 
flux ratio of 10°4 ~ 2.512, and 2.5 mag corresponds to a flux ratio of 10. 

Since the atmosphere absorbs some photons, and others are lost in the 
telescope systems and are not detected, we do not directly measure the flux 
f, that reaches the solar system from the star. Instead, we measure 


f= | FTLFLRy dv, (2.30) 
0 


where T, is the transmission of the atmosphere, F, is the transmission of 
any filter, which has been put into the system to isolate a particular range 
of frequencies, and R,, is the efficiency of the telescope system; that is, R, = 
(energy detected/incident energy),. 

The transmission of the atmosphere satisfies T,, « e~*, where a is pro- 
portional to the column density of air along the line of sight. This is obviously 
least at the zenith and increases towards the horizon. The ratio of the actual 
value of a to its value ap at the zenith is called the air mass= a/ag. For 
a plane-parallel atmosphere, air mass = sec z, where z is the zenith distance 
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of the object being observed. Thus atmospheric extinction produces a 
dimming of starlight according to the relation 


m(z) = ksec z + constant, (2.31) 


where k is a constant and m/(z) is the observed stellar magnitude at zenith 
distance z. We can correct for atmospheric extinction (at wavelengths where 
the atmosphere is not totally opaque) by observing the magnitude of a star at 
several values of z. A fit to these data then yields the constant k in equation 
(2.31) and allows us to correct not only to unit air mass at z = 0 but also 
to extrapolate to zero air mass (that is, “secz = 0”) and thus obtain the 
magnitude outside the Earth’s atmosphere — see Sterken & Manfroid (1992) 
for details. 

The filter transmission F, in equation (2.30) is readily determined and 
can, in fact, be chosen at will to measure the energy contained in definite 
wavelength intervals. Most photometric systems employ several different fil- 
ters, and the filter band used must always be stated when giving an apparent 
magnitude. A plot of F, versus wavelength will typically show a hump that 
may be crudely characterized by a central or effective wavelength eg 
and a width. The latter is conventionally given as the wavelength difference 
between the points at which F, falls to half its peak value — this quantity is 
called the full width at half max or FWHM for short. Tables 2.1 and 2.3 
list these parameters for some widely used filter systems. 

The instrumental efficiency R, in equation (2.30) is a composite of the 
efficiency of the telescope’s optical system and the sensitivity of the photon 
receiver itself. While mirror reflectivities can be measured fairly easily, the 
other two factors just mentioned are extremely difficult to determine accu- 
rately. In practice, therefore, the system must be calibrated by measuring 
its response to a source of known brightness. At most observatories this is a 
standard star, whose brightness has been determined (with great difficulty) 
at some other observatory by using a telescope to observe both the star and 
a terrestrial source of known characteristics — see below. The determination 
of the brightnesses of standard stars is difficult, and the results correspond- 
ingly uncertain, so we know the relative brightnesses of stars much more 
accurately than their absolute brightnesses. 

Until the 1950s, most astronomical photometry was done photograph- 
ically with two systems: the blue-violet-sensitive international photo- 
graphic system giving magnitudes mpg, and the photovisual system, 
Mpy Whose wavelength sensitivity simulates that of the eye. Unfortunately, 
the photographic plate has a nonlinear response to different levels of light in- 
tensity, and the relation between incident intensity and photographic darken- 
ing must be determined empirically by calibrating each plate. Furthermore, 
the dynamic response range of a plate is only about a factor of twenty in 
intensity; to cover larger ranges, a set of several plates must be used. Given 
such properties of the detector, it becomes very difficult to do absolute pho- 
tometry and to extend a magnitude scale over a wide range of intensity 
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without making large systematic errors. Many of the results in the older 
photographic photometric catalogs (see Appendix B) are, unfortunately, se- 
riously affected by such errors. On the other hand, if some stars on the plate 
have accurately known brightnesses, then it is usually possible to make dif- 
ferential measurements to +0.1 mag, which is adequate for many purposes. 
In the best cases, such differential measurements may be accurate to £0.03 
mag. 

Astronomical photometry was revolutionized with the advent of photo- 
electric photometers. These instruments have strictly linear responses, they 
have an enormous dynamic range, and they are capable of factor-of-ten bet- 
ter precision than photographic plates. Consequently, they yield magnitudes 
accurate to +0.01 mag and magnitude differences often accurate to +0.002 
mag. At this level of accuracy, color-magnitude diagrams of star clusters 
become sensitive diagnostic tools (see §3.5). The advent in the early 1980s 
of CCD detectors was a further important advance since, like photoelec- 
tric detectors, CCDs are linear, but, unlike photoelectric detectors, they are 
imaging devices. Thus, with a CCD many stars can be measured simultane- 
ously with high accuracy. Moreover, CCD photometry can be done for stars 
in a very crowded field, such as the centre of a globular cluster, which are 
inaccessible to conventional photoelectric photometry. 

The standard photometric system today is based on the ultraviolet- 
blue-visual (UBV) system of Johnson & Morgan (1953). As measure- 
ments in the infrared became more important, this system was extended by 
adding seven bands reaching out to beyond 1000 nm ~ see Table 2.1. Magni- 
tudes in the extended system are denoted by the capital letters designating 
the filter; thus V magnitudes are now the standard visual magnitudes, K 
magnitudes the most important near-infrared ones, and M magnitudes the 
longest-wavelength ones in common use. 

A photometric system is ideally defined by a particular filter-detector 
combination. Using this equipment an observer anywhere can reproduce 
accurately a given spectral response. Of course different observers have tele- 
scopes of varying quality, so no two observers would measure exactly the 
same brightness for a given star. However, so long as they use the same 
filter-detector combination, they should always agree on the ratio of the 
brightnesses of any two given stars. Consequently, a photometric system 
such as the UBV system is in practice established by one particular ob- 
server assigning magnitudes to stars that are well distributed over the sky. 
Any other observer can then measure the brightness of any star by using 
the standard filter-detector combination to compare the star of interest to a 
nearby standard star. 

Unfortunately, no photometric system satisfies the ideal just described 
because the steady advance of technology soon renders any given filter— 
detector combination obsolete. Consequently, astronomers are forever trying 
to measure, say, Johnson—-Morgan B-band magnitudes with equipment that 
does not have the B-band spectral response implicitly specified by Johnson 
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& Morgan (1953). Strictly speaking, this measurement is possible only if the 
star being observed has exactly the same spectrum as one of the Johnson & 
Morgan standard stars; the contemporary astronomer’s equipment and that 
of Johnson & Morgan would measure different portions of the spectra of the 
program and standard stars, and so intensity ratios measured with each in- 
strument will agree only if the spectra of the two stars are proportional to 
one another. 

In practice UBV-values accurate to ~ 0.01 magnitudes can be obtained 
by choosing a filter that matches a contemporary detector to the Johnson— 
Morgan filter-detector combination.1! Moreover, brightnesses measured with 
several different filters usually provide sufficient knowledge of any deviations 
between the spectra of the program and standard stars to enable accurate 
UBV-values to be determined even when none of the adopted filters pro- 
vides a match to the Johnson—Morgan filter-detector combination. Suppose 
an object has been observed through filters that do not precisely coincide 
with ones that match the standard filter-detector combinations, but pro- 
vide reasonable approximations to those combinations. Let magnitudes in 
the observed bands be denoted U’, B’, V' etc. and those in the UBV system 
are called U, B, V, etc.12. These magnitudes are ‘transformed’ to standard 
UBV-magnitudes by relations of the type 


V =ayV' + By (B'-V')+w. (2.32) 


The constants ay, Gy and yy are determined such that the transformed 
UBV-magnitudes of standard stars agree as closely as possible with the pub- 
lished UBV-magnitudes of these stars. Leggett (1992) gives transformation 
equations of this type between different photometric systems. 

The extension of the UBV-system into the infrared has not proceeded 
smoothly, with the result that the bands longward of V are not as well defined 
as is desirable. Johnson & Mitchell (1962), Johnson et al. (1966), Johnson 
(1965) added bands RIJKLMN that cover the range 700—10040 nm. Cousins 
(1976) later redefined the RI bands, and when FR and I magnitudes are now 
cited, they are usually (but not invariably) on the Cousins rather than the 
Johnson system. Glass (1974) defined an H band that lies between the J 
and K bands, rather than coming before J. Other small adjustments to the 
Johnson infrared bands have been made by different observers with the result 
that there is now no truly standard system. A useful definition of JHKL 
magnitudes will be found in Elias et al. (1982). In the infrared, the Earth’s 
atmosphere is only transparent within certain wavelength ‘windows,’ and the 
choices of central wavelengths and bandpasses for the infrared extensions of 


11 Mis-matched filters are a significant cause of confusion, however. See Bessell (1993) 
for a discussion of this problem. 

12 We use the standard convention that V = my, etc., where my is an object’s apparent 
magnitude in the V band. 
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the UBV system have, to a large extent, been dictated by the locations of 
these windows. 

Table 2.1 gives the effective wavelengths and the full widths at half max- 
imum sensitivity for both the extended UBV-system and several other widely 
used filter systems. The Thuan-Gunn system was designed to avoid wave- 
lengths at which the spectrum of the night-sky shows prominent emission 
lines. The Hipparcos band was defined to be exceptionally broad so as to 
maximize the flux received from stars of every color. 

Table 2.2 lists the parameters of some of the filters used for observations 
with HST. These filters, which correspond approximately to Johnson band- 
passes, are the most widely used of the large set that is available on HST 
instruments. For the optical imager, WFPC2, any filter may be identified by 
a name such as F555W, which indicates a filter centered on 555nm that has 
a wide bandpass (other possible suffixes being ‘M’ for medium bandpass, ‘N’ 
for narrow bandpass, and ‘LP’ for a filter which passes all the light longward 
of a particular wavelength). The infrared imager, NICMOS, has a similar 
system for designating its filters except that the wavelength scale is shifted 
by a factor of ten: thus, for example, F175W in this instrument is a wide 
bandpass filter centered on 1.75um. Since HST is not restricted to parts 
of the infrared where the Earth’s atmosphere is transparent, the NICMOS 
filters give more complete spectral coverage than the extended UBV system 
in the near infrared. 

The zero-points of each scale are usually chosen such that the star Vega 
hasU = B=V=Hp=Vr...=0. Examples of V magnitudes are: Sun, 
V = —26.74; Sirius (apparently brightest star), V = —1.45; faintest stars 
measured, V = 27.13 The total range from the Sun to the faintest measurable 
stars is about 54 mag or a ratio of 3 x 107! in apparent brightness! 


2.3.2 Colors 


Suppose we have a photometric system with several filter bands at different 
wavelengths. Then, by taking the difference in magnitudes measured in two 
different bands, we can form a color, or color index. That is, if A and B 
denote two different filters, we write 


So” AA Sy (A) fr 
fo. AA Sy (B) fr’ 


where S) denotes the combined telescope-receiver-filter sensitivity.14 A color 
index is usually written using the letters that denote the different filters 


(C.I.)ap =m, — mg = const. — 2.5 log (2.33) 


13 Remarkably, for historical reasons, the apparent magnitude of Vega, V = +0.04, 
differs significantly from zero. 

14 §, which is usually normalized so that its peak value is 1, is tabulated for several 
systems in Lamla (1982). 
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Table 2.1 Filter characteristics of broad-band photometric systems 
et FWHM  fx(AO0V) is 


System Band ae ao ay 1025 W Mo 
UBVRI U 365 66 1780 1.86 5.61 
B 445 94 4000 4.67 5.48 

V 551 88 3600 4.64 4.83 

R 658 138 3060 6.94 4,42 

I 806 149 2420 4.71 4.08 

J 1220 213 1570 2.49 3.64 

H 1630 307 1020 1.81 3.32 

K 2190 390 636 0.82 3.28 

LE 3450 472 281 0.17 3.25 

M_ 4750 460 154 _ _ 

Hipparcos Hp 550 225 —~ —- - 
Tycho By 420 75 — _ - 
Vr 510 100 a - _ 

Thuan—Gunn sg 512 120 — _ _ 
r 668 100 — — - 

a 792 150 - _ - 

z 912 140 — - - 

SDSS u’ 352 63 — _ - 
g’ 480 141 — - ~ 

r! 625 139 - — - 

a 769 154 ~ _ — 

z! 911 141 _ - — 


SOURCE: From data published in Bessell (1993), Perryman et al. (1997), 
Thuan & Gunn (1976), Schneider, Gunn & Hoessel (1983) and Fukugita 
et al. (1996). The UBVRI bands are for the combined Johnson—Cousins— 
Glass system. fx(AOV) is the X-band flux density [equation (2.40)] of a 
V = 0 AOV star. The solar luminosities and solar magnitudes to the red 
of V were derived from Buser & Kurucz (1992) solar models by W. Vacca. 


Note that 1Jy = 10-7 Wm-? Hz" 


involved; that is, (A—B) for the hypothetical example just given or (B—V) 
and (U — B) for the standard UBVRI system. As is clear from equation 
(2.33), a color index essentially measures the ratio of stellar fluxes near the 
effective wavelengths of the two bands. Since this quantity depends only 
on a ratio of fluxes, it measures a property of an object’s spectrum that is 
independent of its distance (which is highly desirable, given the uncertainties 
in the distances to many astronomical objects). 

The detailed shape of the energy distribution emitted by a star is deter- 
mined by a few basic physical parameters such as the stellar temperature, 
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Table 2.2 Broad-band HST filters 
Instrument Name Aee/nm FWHM/nm UBV analog 


WFPC2 F336W 332.7 37.1 U 
F439W 429.2 46.4 B 
F555W 525.2 122.3 V 
F675W 673.5 88.9 R 
F814W 826.9 175.8 vf 
F300W 292.4 72.8 Wide U 
F450W 444.5 92.5 Wide B 
F606W 584.3 157.9 Wide V 

NICMOS F110W 1100 300 J 
F140W 1400 500 
F160W 1600 200 Narrow H 
F175W 1750 550 Wide H 
F187W 1875 125 
F205W 2050 300 


SOURCE: From Biretta et al. (1996) and MacKenty et al. (1997); see also 
http://www.stsci.edu/instrument-news/handbooks/ 


Table 2.3 Filter characteristics of intermediate-band 
photometric systems 


System Band = Aeg/ nm FWHM/ nm 
Stroémgren u 349 30 
v 411 19 
b 467 18 
y 547 23 
Bu 489 15 
Bn 486 3 
DDO 45 451.7 7.6 
42 425.7 7.3 
41 416.6 8.3 
38 380.0 17.2 
35 349.0 37.0 


SOURCE: From data published in Crawford (1966) and 
McClure & van den Bergh (1968) 


surface gravity, and chemical composition. By measuring broad-band colors, 
we can, to some extent, probe these physical properties. For example, since 
cool stars are red and hot stars are blue, a color index such as (B — V) that 
measures the flux ratio between a short and a long wavelength provides a 
measure of the stellar temperature. As another example, the ultraviolet spec- 
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tra of some stars contain large numbers of absorption lines whose strengths 
reflect the abundances of the heavier elements in their atmospheres, while the 
B band is relatively line free. Thus, (U — B) is sensitive to stellar chemical 
composition. Unfortunately, there is another important physical parame- 
ter that also affects the observed value of (U — B). There is a sharp drop 
in continuum intensity in some hot stars at wavelengths shorter than about 
370 nm; this drop arises from the sudden onset of continuum absorption from 
the n = 2 level of hydrogen, and it is called the Balmer jump. Since 370nm 
lies right in the middle of the U band, (U — B) is sensitive to the strength 
of the Balmer jump as well as to chemical composition. 

As this example illustrates, broad-band filters provide too crude a mea- 
sure of a star’s spectral energy distribution to distinguish between the effects 
of quite disparate physical phenomena. In order to lift such degeneracies, we 
need to use more finely-tuned filters, which pick out individual features in a 
star’s spectrum. Although it is tempting to design a filter set with very nar- 
row bandpasses in order to isolate spectral features, such filters will transmit 
so little light that their use is not practicable for any but the brightest stars. 
One is therefore forced to compromise, and adopt intermediate-bandpass sys- 
tems with filter bandwidths in the range 10nm S FWHM XS 30nm. Once 
such a filter system has been constructed, we can derive correlations be- 
tween the chosen color indices and the physical parameters of stars, so that 
we can ultimately use the observed colors as diagnostic tools to infer the 
physical properties of stellar atmospheres. Table 2.3 describes two impor- 
. tant intermediate-band filter sets, the Str6mgren and DDO systems, which 
have been developed for this purpose. 

The Stromgren system [Stromgren (1966), Crawford (1966)], has four 
intermediate-band filters, wuby, and a narrow-band pair, (8p, B,), measuring 
the H#-line and adjacent continuum. The usual color indices employed in 
this system are (b—y), (u—b), cy = (u—v)—(v—5), and my = (u—b)—(b—y). 
Both (b— y) and (u—b) serve as temperature indicators; c, is a temperature 
indicator for hot stars and a luminosity indicator for cooler stars; (Gn — Bw) 
is a luminosity indicator for hot stars and a temperature indicator for cooler 
stars. For cooler stars m, measures the amount of line-blanketing, the 
dimming of the blue part of the spectrum caused by millions of heavy-element 
absorption lines. The parameter m, also acts as an indicator of spectral 
peculiarity in some stars. For recent work on the calibration of this system, 
see Stromgren (1987), Olsen (1987), Edvardsson et al. (1993). 

The DDO system employs five filters. The C(42 — 45) color index is 
primarily a measure of effective temperature (§3.4), while the C(45 — 48) 
color is sensitive to surface gravity. The C(41 — 42) index is sensitive to the 
strength of spectral lines due to CN molecules and Fe atoms, and is thus a 
measure of the abundances of heavier elements. Details of the calibration of 
the DDO system can be found in Claria et al. (1994). 

Finally, it must be remembered that the effects of interstellar extinc- 
tion on colors have been ignored in this discussion. Because the interstellar 
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medium scatters light more efficiently at short wavelengths than at long wave- 
lengths, transmitted starlight is not only dimmed by interstellar extinction 
but is also reddened. It is therefore necessary to correct observed colors for 
interstellar reddening in order to derive intrinsic colors. As we shall see in 
§3.7.1, it is fortunate that interstellar extinction is accompanied by redden- 
ing, for it is possible to devise combinations of color indices that allow us to 
determine the amount of reddening present, and hence to correct for both 
reddening and extinction. 


2.3.3 Absolute magnitudes 


The energy flux we receive at the Earth from an object depends on both its 
intrinsic brightness and its distance. If F is the flux received when the object 
is at distance D, the flux f that would be received if it were at some other 
distance d is given by the inverse square law, 


f= (=)F (2.34) 


Obviously, the farther away an object is, the fainter it will appear, and to 
obtain information about the relative intrinsic brightnesses of objects, we 
must account for differences in their distances from us. We therefore define 
the absolute magnitude M to be the apparent magnitude an object would 
have if it were located at some standard distance D. From equations (2.27) 
and (2.34), we see that 


= ie d 
m—M = —2.5log ($=) = 5log (5). (2.35) 
The standard distance D is always taken to be 10 pc, so if d is measured in 
parsecs, then 

m—M =5logd—5. (2.36) 


The quantity (m — M) is called the distance modulus of the object. If we 
know m and d, we can immediately correct for the nonstandard distance of 
the object and reduce the apparent magnitude m to the absolute magnitude 
M via equation (2.36). Conversely, if we know m and M, we can infer d. 

Absolute magnitudes are normally quoted for the visual band and de- 
noted My. It should be noted that the absolute magnitude is not a direct 
measure of the total energy output (luminosity) of an object, but only of 
the energy in the V band. To measure total energy output, we use so-called 
‘bolometric magnitudes,’ which will be discussed in §2.3.4. 

The distance from the Earth to the Sun is 1 AU = (1/206, 265) pc. Thus, 
we know immediately that the distance modulus of the Sun is —31.57 mag 
and therefore that the absolute magnitude of the Sun My(©) = +4.83. We 
shall see in §3.5 that the Sun is a star of rather average intrinsic brightness — 
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Table 2.4 Redshifts at which one UBV band is shifted to an- 
other 


U B Vv R I J HA &K L M 
U 0 0.22 0.51 0.80 1.21 2.34 3.47 5.00 8.45 12.01 
B 0 0.24 0.48 0.81 1.74 2.66 3.92 6.75 9.67 
V 0 0.19 0.46 1.21 1.96 2.97 5.26 7.62 
R 0 0.22 0.85 1.48 2.33 4.24 6.22 


the most luminous stars are about 10° times brighter, and the least: luminous 
are more than 10* times fainter. 

Finally, we must caution that absorption and scattering of photons in 
interstellar space cause stars to appear dimmer than they would be from 
distance effects alone, and hence increase their apparent magnitudes — see 
§3.7.1. This dimming is usually expressed in magnitudes. If there are A 
magnitudes of interstellar extinction, equation (2.36) must be rewritten as 


m—M=5logd—5+A. (2.37) 


In §3.7.1 we shall see that the value of A depends strongly on waveband in 
the sense that it decreases towards longer wavelengths. 

If the object is moving rapidly away from us, perhaps because it lies at 
a cosmologically-significant distance, the photons received in the V band, for 
example, will have been emitted at shorter wavelengths. Consequently, the 
observed value of V is not directly connected to the value My that we would 
measure if the object were stationary at 10 pc. If we know the shape of the 
object’s spectrum, we can infer a relationship between V and My, however. 
This relationship is written 


m—M=5logd—5+A+K, (2.38) 


where K is called the K-correction. For several different photometric sys- 
tems and galaxy types, Frei & Gunn (1994) tabulate a quantity k(z), which 
is related to K by K = k+2.5log(1+z), where z is the redshift of the galaxy 
[equation (2.11)]. 

Since we have good spectral information only for nearby galaxies and the 
spectra of galaxies evolve in time, K-corrections are reliable only for nearby 
galaxies, for which k ~ 0. In reality we should compare the brightnesses of 
a distant galaxy with the brightness of a nearby galaxy in the waveband at 
which the photons collected from the distant galaxy were emitted. Table 2.4 
gives the redshift z for which light emitted at the center of either the U, B, 
V or R band will be observed to lie at the center of another band in the 
UBV system. For example, that we make J-band measurements of a galaxy 
at z = 1.21. Then the collected photons were emitted in the V-band and we 
can accurately infer the V magnitude that would be measured by an observer 
located at the Sun for whom the galaxy was at rest. 
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2.3.4 Absolute energy distributions and bolometric magnitudes 


Color indices provide a good deal of useful information about stellar energy 
distributions, but, as was mentioned earlier, we would ideally like to know 
the detailed variation of the energy flux per unit wavelength (or frequency) 
interval throughout the entire spectrum. If we have such observations, we 
can apply the theory of stellar atmospheres, and analyze the data to infer 
physical characteristics of stars [see Mihalas (1978), §7-4]. Furthermore, we 
can use absolute energy distributions to obtain total luminosities and stellar 
temperatures — see §3.4. 

The basic problem in this work is the measurement of the absolute 
efficiency of the telescope-receiver system; that is, the determination of the 
amount of energy that must be put into the system in order to produce unit 
instrumental response. The only practical way to derive the efficiency is to 
use the instrument to observe a source whose absolute energy distribution 
is known in advance, for then we know the amount of incident energy that 
produces the measured response. There are only two types of sources for 
which we can specify the rate of energy emission from unchallengeable theory: 
(i) a blackbody source at a known temperature, the energy distribution of 
which is given by the Planck function B,(T), and (ii) a synchrotron-radiation 
source, from which the emission by relativistic electrons can be calculated 
as a function of their energy. The procedure is to observe such an absolute 
reference source with a given telescope-spectrometer system and then, with 
the same system, to observe a standard star. Observations of the absolute 
reference source calibrate the instrumental efficiency at each wavelength, 
and thus provide the factor required to convert the observed responses to 
the stellar radiation into an absolute energy emission from the star (after 
correction, of course, for atmospheric extinction). 

The literature now contains measurements of the absolute energy dis- 
tribution of a Vir,!® 7 UMa, and @ Leo in the range 137nm < A» < 292nm 
(Bless et al. 1976), of a Lyr in the range 130nm < A < 200 nm (Tanaka et al. 
1984), of Vega in the range 330nm < A < 1080nm (Hayes & Latham 1975), 
and of a Tau in the range 1200 < A < 35000nm (Cohen, Walker & Wit- 
teborn 1992). The ultraviolet and infrared data are usually given directly 
in Janskys.'® Optical data are often given in terms of a relative absolute 
energy distribution, written in monochromatic magnitudes, 


my, = —2.5log (f./ fo), (2.39) 


plus the absolute flux f,, at the reference frequency vp. With vp chosen 
to correspond to A = 555.6nm the present best estimate of f,, for Vega is 


15 Bright stars are designated the name of the constellation within which they lie, 
preceded by a Greek letter; an a indicates that the star is the brightest in its constellation, 
GB the second-brightest, etc. 

16 See the Notes to Table 2.1 for the definition of a Jansky. 
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fu = 3560 + 25 Jy (Mégessier 1997). This format is chosen because relative 
monochromatic magnitudes are easier to determine, and probably more accu- 
rately known, than the absolute flux normalization. They are therefore likely 
to remain unchanged even if the absolute flux requires future adjustment. 

Once the absolute energy distribution of even a single star is known 
precisely, it can henceforth be used as a reference standard directly on the 
sky. Observations of the absolute energy distributions of other stars are 
then relatively simple - one merely measures program stars and the refer- 
ence star with the same equipment, corrects for differences in atmospheric 
extinction, and then uses the known absolute distribution of the standard to 
obtain absolute distributions for the program stars. Catalogs of spectropho- 
tometric data for hundreds of stars of various spectral types are available 
(see Appendix B). These data are indispensable for performing astrophysical 
analyses both of individual stars and of the integrated energy distributions 
from stellar systems (clusters and galaxies), which are composites of contri- 
butions from many different spectral types. 

One often wants to know the flux from a star averaged over one of the 
standard photometric bands. In some band X (X = U,B,V,...), the X-band 


flux re 
- to dv Sx(v)f, 
- fo dv Sx(v) , 


where Sx (v) is the response of the standard X-band filter-photometer com- 
bination. This quantity is determined by observing sources of known spectral 
energy distribution as described above in connection with the determination 
of f,. Once fx has been determined from (2.40) for a single standard star, 
we can immediately infer fx for any other star through 


Sx (2.40) 


log [fx (*)] = log [fx (standard)] — 0.4[X(*) — X(standard)]. (2.41) 


fx is tabulated for an AOV star in Table 2.1. 

The integral of the absolute energy distribution over all frequencies is 
an important quantity and is usually reported as a magnitude by defining 
the apparent bolometric magnitude m,,) as 


Myo = —2.5 log ( i ap fv) Ce. (2.42) 
0 


Here Ch) is a constant that is determined by convention. Unfortunately, two 
conventions are in widespread use, as will be explained below. Independent of 
the choice of Cy), the difference between the absolute bolometric magnitudes 
of two stars gives the ratios of their luminosities. 

The difference between my), and the star’s magnitude in a band X, V, 
is called the bolometric correction BCx: 


BCx = Mbol — Xx. (2.43) 
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If two stars have identical spectra, they will have identical values of BCx, 
even if their luminosities are different. Consequently, we may consider BC x 
to be a function of spectral type only, and BCx only needs to be deter- 
mined for one standard of each spectral type and the results tabulated, as 
in Table 3.7. 

The definition (2.43) of BC involves the constant Cho of equation 
(2.42). One scale for bolometric magnitudes and corrections is determined 
by choosing Cy.) such that the Sun’s V-band bolometric correction is zero: 
BCy(©) = 0. In MKS units, the corresponding value of Cy.) turns out 
to be Cho. = —18.90 (Problem 2.3). In the other widespread conven- 
tion, Cho is chosen such that all bolometric corrections are negative and 
BCy(©) = —0.19. 


2.3.5 Mass-to-light ratios 


The brightness of an astronomical object is often quantified by stating how 
many Suns, placed at the object’s distance, would be required to produce 
the observed flux. For the waveband X, this number is 


N = Lx/Lox = 10-°4(Mx-Mox) (2.44) 


where Lx and My are, respectively, luminosity and absolute magnitude 
in the X band. Hf the object’s mass, M, can be somehow measured, it is 
interesting to compare M with the mass N Mo by calculating the mass- 
to-light ratio 


M es M/Mo = 10°4(Mx—Mox, M (2.45) 


yar NMog Lx/Lox Mo 


If the object’s spectral energy distribution is the same as that of the Sun, Tx 
is the same for every waveband X. Equation (2.45) shows that in the general 
case Tx varies with X. Consider, for example, the case of a giant elliptical 
galaxy, which might have B — K = 3.3, whereas the Sun has B — K = 2.2. 
Thus for this galaxy Yp/Tx = 10°-40-3-2-2) = 2.75. 

The wave-band dependence of Tx has been the source of considerable 
confusion. Consequently astronomers sometimes try to estimate the bolo- 
metric mass-to-light ratio 


M/Mo 
Lpoi/ Lebo 


Unfortunately, [yp 1 can only be directly measured by observing over the en- 
tire electromagnetic spectrum, which is rarely possible. Hence estimates of 
Tpo tend to rely on assumptions about the object’s spectral energy distri- 
bution that may be seriously in error. 


Toot — (2.46) 
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2.3.6 Surface brightness and isophotal radii 


To this point we have been concerned with total brightnesses and luminosi- 
ties. Galaxies, unlike most stars, are resolved objects, so that in addition 
to measuring the total radiative flux from a galaxy, we can, in principle, 
measure for each point in its image the surface brightness, which is the 
radiative flux per unit solid angle of the image. To a first approximation, the 
surface brightness of an extended object is independent of its distance from 
us. As one removes the object to a distance r from the observer, the amount 
of light received falls off as 1/r?, but the solid angle subtended by the object, 
over which its light is distributed, falls off in the same proportion, so that the 
amount of light per square second of arc remains constant. Thus the surface 
brightness at a given point on the surface of the galaxy is a well-defined, 
distance-independent quantity.1” 

Optical astronomers measure surface brightnesses in magnitudes per 
square arcsecond. The following example will demonstrate that these are 
strange units. Suppose that the surface brightness J; at a given point in 
the image of a nearby elliptical galaxy is 21 mag arcsec~?. Then a square 
one arcsecond on a side around that point emits as much light as a star 
of magnitude 21. Now suppose that the this galaxy lies in front of a more 
distant spiral galaxy and that the given point coincides with the center of 
that galaxy, where the surface brightness Iz is also 21 magarcsec~?. Then 
what is the combined surface brightness J, + Ip of the two galaxies? The 
answer is not 42 mag arcsec~? but 20.25 mag arcsec~? because the amount of 
light coming from the two galaxies is 2 x 10721/2-5 — 19~?0-25/2.5 arbitrary 
units. 

In this book we employ the notation J = 21y, to indicate a surface 
brightness in the B band equal to 21 mag arcsec™?. 

Radio astronomers frequently measure surface brightnesses in degrees 
Kelvin, and say that they have measured the object’s ‘brightness tempera- 
ture’ Tg. If Tg = 20K, then the object’s surface brightness at the wavelength 
of the measurement is the same as that of a black body whose temperature 
is 20K — see 88.1.4 for more details.1® 

Since galaxies do not have sharp edges, it is usual to characterize the 
physical size of a galaxy by quoting an isophotal radius or isophotal 
diameter, that is, the radius or diameter at which some particular surface- 
brightness level is reached. For example, the Reference Catalogues (de Vau- 
couleurs, de Vaucouleurs & Corwin 1976, de Vaucouleurs et al. 1991) cite 
Dos, the diameter that one estimates the J = 251g isophote would have if 
the galaxy were seen face on and unobscured by dust. We shall frequently 
cite values of Ros = 4 Dos, which is frequently called the de Vaucouleurs 


17 In fact, the greater the distance to a galaxy the greater its redshift, and the redshift 
tends to cause the surface brightness at a given wavelength to decrease with distance. 

18 §8 1.4 also explains the other surface-brightness units commonly employed by Radio 
astronomers: Jy kms7!. 
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radius. An older, but still sometimes cited isophotal radius is the Holm- 
berg radius, which is the semimajor axis of the I = 26.5 up, isophote. 


2.4 Gravitational lensing 


Two of the fundamental problems of astronomy are (i) determining the phys- 
ical scale of the Universe and of the objects such as galaxies that lie within 
it, and (ii) determining the masses of these objects. Any physical effect that 
enables us to make progress with either of these tasks is precious. The phe- 
nomenon of gravitational lensing, which was first observed in 1979 (Walsh et 
al. 1979), enables us to make progress with both of these tasks. In view of this 
fact, a great deal of effort has been directed towards discovering instances of 
gravitational lensing over the last twenty years, and towards understanding 
the physical implications of such instances. In the last few years, these efforts 
have begun to bear a rich harvest so that gravitational lensing must now be 
reckoned to be a powerful probe of the structure of the Milky Way (§10.1) 
and of the Universe at large (§7.2.3). In the coming years, the astronomical 
importance of this phenomenon will surely grow. In this section we describe 
the theoretical framework that underlies these advances. 

A gravitational field can deflect a beam of light. In practical applica- 
tions this deflection is best understood by considering that gravity causes 
the refractive index of the vacuum to differ from unity. Quantitatively, the 
refractive index of the vacuum is given by (see Appendix A) 


2|9| 


n= 1+ es (2.47) 


When a ray passes through a region within which the refractive index n is 
a function of position, it is deflected through some angle @ — see Figure 2.12, 
which illustrates the simple case in which source and observer both lie within 
the plane of the ray. Quantitatively, a is given by (Appendix A) 


o(x1) = - FV 1 82(x1), (2.48) 


where x. is the ray’s impact parameter and the function ®z is related to the 
deflecting mass density p(x) by 


a(x) = 6 f a?x!, E(x) In|(x- x). 


B(x) = f de! o(x’) 


(2.49) 
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deflector t 


Figure 2.12 The definition of the Einstein radius rg. 


observer \ 


Equation (2.49) has a simple physical interpretation: the deflection is given 
by the gradient of the gravitational potential that is generated (in two dimen- 
sions) by the surface density that one obtains by integrating the deflecting 
mass density along the line of sight. 

An important special case is that in which the matter distribution is 
effectively that of a point mass M — that is, the deflecting matter distribution 
is confined well inside the impact parameter x, of every ray of interest. Then 
@2(x,) = GM In|x_| and 

_ 4GM 


aa (2.50) 
The situation when the source, mass and observer all lie on a straight line 
is described by Figure 2.12: rays that encounter the mass at small impact 
parameter cross the source—observer line in front of the observer, while rays 
that encounter the mass at large impact parameters cross behind her. The 
ray that encounters the mass with critical impact parameter rg reaches the 
observer. Since, in the notation of Figure 2.12, 65 ~ rz/Ds., 90 ~ TE/ Dt 
and a = 0s + 90, it follows with a little algebra from (2.50) that 


V/ 4GM | Ds.Dz 
Tr = __ ———.,, 
= ce Ds + Du 


The critical radius rp is called the Einstein radius of the deflector. Note 
that it depends on the relative positions of source, observer and deflector 
in addition to the deflector’s mass. If the source, deflector and observer are 
collinear as in Figure 2.12, the observer sees a bright ring of radius rp around 
the deflector. The angular radius of this ring is the Einstein angle 


4GM Ds 
65 = \/ : 2.51 
‘ c Dz (Ds + Dz) ev 


The Einstein radius is a dimensionally important quantity because lensing 
significantly modifies the appearance of a source that lies within about rg of 


64 Chapter 2: Astronomical Measurements 


I I, 
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B Figure 2.13 Images of a patch S on 
5 the surface of a source are formed at 
= 6 Iy and I_. 


the deflector—observer line, while a source that lies further than rg from this 
line will be seen very much as it would be if the deflector were not present. 
Objects that lie within rg of a deflector are said to be strongly lensed, 
while any distortion of the images of objects that lie further out is ascribed 
to weak lensing. 

Imagine observing stars that lie at a common distance Ds,,+Dy through 
a screen of point-mass deflectors of mass M that all lie at distance Dy. 
The probability that a given source is lensed is the probability that it lies 
on the sky within one of the disks of radius 0g that we may draw around 
each deflecting mass. For astronomically interesting surface-densities N of 
deflectors, these disks have a negligible probability of overlapping, so the 
fraction of the sky that they cover is N76, and this is the probability that 
any given source will be lensed. When we use equation (2.51) to eliminate 
Oe from our expression for this probability, we find the latter is 


me 4nGNM Ds 


——————_., 2.52 
2D, Ds, + Dy ( ) 


The probability 7 is called the optical depth to gravitational lensing by 
the screen of deflectors. Notice that it is proportional to the surface density 
of mass, © = NM, contributed by the deflectors; for given mass surface 
density, 7 is the same for a large number of low-mass deflectors as for a 
smaller number of more massive deflectors. 

What does depend on the mass of the deflectors is the angle through 
which light is typically deflected when lensing takes place. For example, for 
Ds, = Dy = 10kpc, 6g = 0.9(M/Mo)1/? mas. Thus when the deflector is 
a star and the source and deflector lie within the Milky Way, the deflection 
angle is too small to be measured even with HST. We can none the less detect 
such a lensing event by monitoring the luminosity of the star as it is lensed: 
we shall show that this temporarily increases when a deflector passes close 
to the line of sight to the source in very much the same way that a burning 
glass locally increases the brightness of the Sun. 

We now calculate the amount by which a point-mass deflector brightens 
an unresolved background star. The key to the calculation is the fact that 
gravitational deflection leaves unchanged the surface brightness of a lensed 
object. Consequently, the luminosity of an image is simply proportional to 
the area of the sky covered by it. Figure 2.13 shows the relevant geometry 
when a source is imaged by a spherically-symmetric deflector such as a point 
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KE sh >< L z) Figure 2.14 A ray from the source 
S is deflected through angle a by the 
s gravitational field of the object D. 


mass. The true position and angular extent of a portion the source is shown 
at S, while J, and J_ mark the locations and extents of the two images of S. 
Each of these images is formed by taking each point of S and moving it along 
the line through that point and the center of the deflector to a new location. 
This location is determined by consideration of Figure 2.14, which shows the 
plane defined by the source point S, the deflector D and the observer O. 
Since the marked angles are in reality very small, we have!® 


n= Ds8=Ds9-Dspa > B=O- 7S 0() (2.53) 
S 


Here a(@) is the function obtained by substituting x, = D,@ into equation 
(2.50). Making this substitution and then using (2.51) we find 


g=6— Dsu_46M 
moe 2 
a a (2.54) 
pe! _— Pee 


For given source location 7 this is a quadratic equation in 6 with solutions 


64 = £(G + 1/6? + 462). (2.55) 


Returning to Figure 2.13, light from the patch S of the source is received 
in the two patches J, and J_, which correspond to the two roots 04 of 
equation (2.55). Their areas A, are seen to be related to the area Ag of S 
by 


Ax _ 04404 
As dp 


= a8) + Seca): 


19 Fquation (2.53) is-often called the lensing equation. 


(2.56) 
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Figure 2.15 The geometry on the 
s sky of a lensing event. 


When we use (2.55) to express everything as a function of u = 3/6, we find 


Ap  utvu?+4)(Vu2+44u 
As 4u uz+4 
Ff u? +2 


Dp SI NS 
2 QuVu2 +4 


The second term on the right of equation (2.57) is numerically larger than 3 
so A_/Ag is negative. This result arises because the image at 6_ is inverted: 
the points that lie nearest the deflector are imaged to points that lie furthest 
from the deflector. Consequently, d6_/d@ < 0 and equation (2.56) yields a 
negative value for A_/Ag. 

Since gravitational lensing preserves surface brightness, in each im- 
age lensing amplifies the intrinsic brightness of the source S by a factor 
a4 = |A,/Ags|. When the individual images are unresolved, the observed 
magnification is the sum 


(2.57) 


uz +2 
a=a,+a_ = ——. 2.98 
+ Sree (2.58) 


As a deflector passes close to our line of sight to a background source, 
6B decreases from a large value and then increases again, with thé result that 
the brightness of the source is amplified by a time-dependent factor a(t). 
From Figure 2.15, which shows the path on the sky of the deflector relative 
to the source, it is clear that 


B(t) = [(ut)? + Borinl””?, (2.59) 


where yu is the magnitude of the relative proper motion, Bmin is the closest 
approach of the deflector to the source, and ¢ is the time to the point of 
closest approach. Figure 2.16 shows plots of the amplification factor a(t) 
that one obtains by inserting (2.59) into equation (2.58) for several values of 
Umin = Brin / OB. 

Each year of order a hundred stars are observed to suffer episodes of 
brightening that can be fitted by the function a(t) of Figure 2.16 for suitable 
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Figure 2.16 As a deflector drifts 
past the line of sight to a back- 
ground star, the star brightens by 


Umin= 1.1 
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0 2 the factor a(t) that is shown here for 
ut / Og several values of umin = Bmin/OE- 


values of uz and umin. We can be pretty sure that these stars are being 
lensed gravitationally because the light curves of a given star in different 
wavebands can be fitted by the same values of » and wpin just as lensing 
theory predicts. By contrast, intrinsic variability in stars is almost always 
associated with color, with the consequence that each waveband yields a 
different light curve. 


2.5 Archival data and catalogs 


The raw materials of any astronomical study are observational data. From 
these data, the first interpretive step is the production of data products. 
For example, from raw images of star fields, one could derive a catalog of 
stellar positions, magnitudes, colors, and so on. Such data products form the 
backbone of most astronomical research, and the importance of the painstak- 
ing work required for their assembly is impossible to overestimate. Since 
astronomical catalogs may have applications to a wide range of research 
problems, it is important that they are widely disseminated. Such catalogs 
also frequently have a very long “shelf-life” — the NGC and IC catalogs of 
galaxies and star clusters are more than a hundred years old, but still form 
the basis for many studies. The long-term archiving of these products is 
therefore also an issue. The original observations from which the products 
were derived can also sometimes be put to uses other than those for which 
they were obtained. It is therefore important that the raw data, as well as 
the processed results, be archived for future use. 

In this section, we discuss some of the archival resources available to 
astronomers. A list of astronomical catalogs is given in Appendix B; this list 
is by no means complete, but includes the more important catalogs that the 
reader is likely to encounter in the literature. Some of these data products 
are included primarily for historical reasons, but it is important to realize 
that historic catalogs can remain of some importance even when more cur- 
rent data are available. For example, even though a more recent spectral 
classification may well have been made, a star is still commonly identified by 
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the HD number that tags it in the Henry Draper Catalog. One area where 
historical precedent is particularly important is in specifying an object’s co- 
ordinates. As we saw in §2.1, there have been a number of revisions in the 
way in which the celestial coordinate system is defined. An object’s position 
is usually calculated by measuring its offset from an object in one of the 
catalogs in Appendix B. The coordinates adopted in these surveys therefore 
define the celestial coordinates of subsequent observations. Indeed, some of 
the revisions to the astrometric catalogs listed in Appendix B, such as FK4 
to FK5, were motivated by the need to improve the consistency and accuracy 
of the celestial coordinate system. The older catalogs will remain important 
even after the world-wide adoption of the ICRS (§2.1.6) as the standard 
astrometric system, since we will still need to be able to cross-reference be- 
tween the various coordinate systems to convert older observations to the 
new system. 

Simple data products such as positions and magnitudes of stars can be 
straightforwardly presented in tabular form. However, when it comes to more 
complex data such as spectra, tables do not provide a very efficient medium 
for presenting the results, so such data are usually published as graphical 
atlases. Similarly, the mass of information contained in an image of a galaxy 
cannot readily be reduced to a few numbers, so a catalog of the morphology 
of galaxies is often supplemented by an atlas of images. Table 2.5 gives 
a short list of atlases that contain galaxy images; browsing through such 
atlases is probably the best way to obtain an overall impression of the range 
of structures that one finds in galaxies, their similarities and differences. 
Our location within the Milky Way means that studies of the large-scale 
structure of our own galaxy need data from all around the sky. Table 2.6 
lists some of the more important optical sky surveys for the study of Galactic 
structure, while Table 2.7 lists some of the all-sky data sets available at other 
wavelengths (further details on HI and CO surveys of the Milky Way can be 
found in Tables 9.1 and 9.2, respectively). 

It is clear from these tables that there is a vast amount of data available 
for studies of galactic structure, and the amount of information is growing 
extremely rapidly. The main reason for this “data explosion” has been the 
advent of digital techniques for data acquisition and storage. Digital data 
are easy to reproduce and disseminate, and the ease with which such data 
can be manipulated by computer makes it possible to analyze and interpret 
even the largest catalogs. 

The proliferation of large data products led to the setting up in the 
1970s of a number of specialized institutes to collect and store data. First 
amongst these astronomical data centers (ADCs) was the Centre de 
Données Astronomiques de Strasbourg (CDS), and this institution still serves 
as the primary clearing house for data, culling results from the literature and 
distributing them to users. 

Another consequence of the widespread sharing of data has been the 
need to develop standard formats for data files, so that data written by 
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Table 2.5 Galaxy atlases 
Atlas Principal content Number Reference 
of objects 

Hubble Atlas Description, classification, 180 Sandage (1961) 
of Galaxies photographs of galaxies 

Atlas of peculiar Photographs of peculiar 338 Arp (1966) 
galaxies interacting galaxies 

Atlas of southern Photographs of peculiar 6445 Arp, Madore 
peculiar galaxies interacting galaxies & Roberton (1987) 

NASA Atlas Photographs of spiral galaxies 322 Sandage & 
of Galaxies useful for establishing the Bedke (1988) 

cosmic distance scale 

Carnegie Atlas Description, classification, 1225 Sandage & 
of Galaxies photographs of galaxies Bedke (1994) 

Digital Catalog CCD images in several colors 113 Frei et al. 
of Galaxies of nearby galaxies (1996) 

Table 2.6 Optical sky surveys 
Survey Telescope Sky Limiting 
Coverage Magnitude 
(and band) 

Franklin-Adams Survey, 10-inch Astrograph All-sky 15 
1911 

Palomar—National Palomar 48-inch —33° <6 < 90° 21(B), 
Geographic Society Schmidt 20(R) 
Sky Survey 
(POSS-I), 1960 

ESO Atlas of Southern Boyden 10-inch Southern 16 
Milky Way, 1969 Astrograph Milky Way 

Canterbury Sky Atlas, Ross 5-inch —90° < 6 < 22° 16 
1972 Astrograph 

ESO/SERC Southern Sky UK 1.2m Schmidt & —90° <6<-17° 23(B), 
Survey, 1980 ESO 1.0m Schmidt 22(R) 

SERC Equatorial J UK 1.2m Schmidt -17° <6<3° 23(B) 
Extension, 1991 22(R) 
(R-band on-going) 

Second Palomar Sky Palomar 48-inch —3° <6 < 90° 22.5(B) 
Survey (POSS-I]), Schmidt 21(R) 
on-going 19.5(1) 

Second Southern Sky UK 1.2m Schmidt —90° <6<~-17° 21.5(R) 
Survey, on-going 

Sloan Digital Sky SDSS 2.5m (5 band b > 30°, alll 23-25(R) 


Survey (SDSS), 
on-going 


imaging + spectra for 


redshifts 


plus a 2° x 50° 
strip through SGP 


NOTES: Since many of these surveys were recorded photographically through non- 
Johnson filters, the bandpasses listed should only be taken as indicative. 
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Table 2.7 Non-optical sky surveys 


Waveband Instrument Sky Reference 
Coverage 
HI (21cm) Hat Creek 85-foot dish —10° <b< 10° Weaver & Williams 
10° <1 < 250° (1973) 
CSIRO 18m dish —10° <b< 10° Kerr et al. (1986) 
240° <1 < 350° 
AT&T 20-foot horn 6 > —40° Stark et al. (1992) 
Dwingeloo 25m dish & > —30° Hartmann & Burton 
(1997) 


1.4Ghz Radio Very Large Array (VLA) 6 > —40° Condon et al. (1997) 
continuum Very Large Array (VLA) Same as SDSS_ Becker et al. (1995) 


CO (2.6mm) 1.2m dishes —15°<b<15° Dame et al. (1987) 
(USA & Chile) all 1 
Infrared Infrared Astronomical Alb-sky Wheelock et al. (1991) 
Satellite (IRAS) 
Cosmic Background All-sky Hauser et al. (1997) 
Explorer (COBE) 
Ultraviolet Roentgen Satellite All-sky Pounds et al. (1993) 
(ROSAT) 
Extreme Ultraviolet All-sky Bowyer et al. (1994) 
Explorer (EUVE) 
X-ray High Energy Astrophysics All-sky Wood et al. (1984) 
Observatory (HEAO-I) 
Roentgen Satellite All-sky Snowden et al. (1995) 
(ROSAT) Voges et al. (1996) 
y-ray COS-B Galactic plane Bloemen (1989) 
Compton Gamma-Ray All-sky Fichtel et al. (1994) 
Observatory (CGRO) Schonfelder et al. 
(1996) 


one computer can be read by another. The most commonly-used format for 
astronomical data is the Flexible Image Transport System, or FITS 
protocol. As its name suggests, this file format was originally created to 
allow images to be moved between machines, but the format is now widely 
used to store and transmit a wide variety of types of data, including catalogs 
and other tables (Schlesinger 1994). 

Increasing computer power and storage capacity have given a number 
of older resources a new lease of life. The Space Telescope Science Institute 
(STScI) took on the mammoth task of digitizing plates from the POSS-I 
and SERC optical surveys (see Table 2.6) to produce a Digitized Sky Sur- 
vey (DSS) of the entire sky. The amount of data involved is huge — some 
600 Gbytes; even after passing the images through sophisticated data com- 
pression algorithms, the DSS still occupies more than a hundred CD-ROMs. 
A second-generation DSS is now being produced by scanning the plates of 
the POSS-II, the Secorid Southern Sky Survey, and the red equatorial exten- 
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sion (see Table 2.6). The second-generation DSS will have higher resolution 
than its predecessor (1 arcsec per pixel as opposed to 1.7 arcsec per pixel), 
so it will occupy over 3 Terabytes. 

Full exploitation of the second-generation DSS requires photometric cal- 
ibration of the original survey plates (Lasker et al. 1990). This calibration 
is tricky because the blackness of a photographic plate is a highly non-linear 
function of the intensity of the light to which it has been exposed. This 
difficult calibration will be largely avoided by the next generation of surveys, 
such as the Sloan Digital Sky Survey (SDSS), which will use large-format 
CCDs rather than photographic plates to image the sky. 

Archival data are also of critical importance to satellite-based obser- 
vations. Due to the great expense and limited lifetimes of such missions, 
data obtained by satellites are even more valuable than data from ground- 
based instruments, and careful archiving of data is vital. Fortunately, their 
archiving is facilitated by the constraints that are inherent in satellite obser- 
vatories: data must be digital so that they can be relayed to the ground, and 
the details of each observation are inevitably carefully documented when the 
observation is scheduled. Further, since the space-based instruments have to 
work reliably in a “hands-off” environment, the modes in which they operate 
are relatively simple and well-documented. Consequently, data taken with 
satellites can readily be interpreted by an astronomer who was not involved 
in the original observations. 

By contrast, ground-based instruments have lagged in producing useful 
data archives. Particularly at optical wavelengths, there has been a tradition 
of the data “belonging” to the observer. Thus, even though most facilities 
now release all data after a proprietary period (typically a year), relatively 
little use has been made of such public-domain data. The more interactive 
nature of ground-based observing can make it difficult to interpret data taken 
by someone else, since it is sometimes hard to reconstruct exactly how the 
data were taken. Nonetheless, the importance of archival data as a resource is 
also being recognized at ground-based facilities, and many are now producing 
usable data archives. 


On-line resources Happily, the huge expansion in the quantity of archival 
data available has been accompanied by a similar explosion in the use of the 
Internet. It is no longer necessary for researchers to maintain their own 
copies of all the databases that they may use in the course of their research, 
since they can access the information across the network from wherever in 
the world it may be physically stored. 

Although the Internet has been used by astronomers for e-mail since the 
early 1980s, it was initially not much used to access data remotely; files could 
be transferred with the file transfer protocol (ftp), but information could 
not be extracted from a file without downloading the whole thing. Further, 
unless one knew the exact name and location of the data file in question, it 
was almost impossible to track down the information one required. 
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The situation improved somewhat with the setting up of telnet services, 
which enable a user to log on to a remote machine and query databases 
stored there. One of the most widely used of these services, called SIM- 
BAD?°, is provided by the CDS to give access to the basic information and 
bibliographic references that they hold on more than a million astronomi- 
cal objects. A similar service for purely extragalactic objects is provided by 
the NASA/IPAC Extragalactic Database (NED). These services allow quite 
sophisticated searches of their databases, but the user interface is a fairly 
rudimentary text-based one. 

The use of on-line resources really took off in the early 1990s with the 
development of the World Wide Web (WWW). This paradigm for access- 
ing data on the Internet emerged from the high-energy physics community, 
but is also well suited to the needs of astronomers for several reasons. First, 
the organization of information into structured hypertext documents rather 
than simple data files makes it straightforward to navigate one’s way to the 
information required, even when the location and name of the file containing 
the relevant data is initially unknown. Second, the WWW allows sophisti- 
cated interactions between the client (the person/machine that is accessing a 
database) and the server (the machine that holds the data), so one can easily 
extract just the data that one requires and in a form that is suited to one’s 
project. Third, the ability of the WWW to transfer and display graphical 
information is ideally suited to image-based astronomical data. 

New on-line resources and tools are currently being developed at a dra- 
matic rate. For example, the Java programming language, designed for run- 
ning complete programs across the Internet, offers a whole new mechanism 
for the remote manipulation of data. Any attempt to provide a definitive 
description of the electronic resources available is therefore doomed to be out 
of date long before it gets into print. In fact, the only way to keep abreast of 
developments is by using the existing resources regularly and trying out new 
ones as they emerge. A good starting place for finding the available astro- 
nomical resources is provided by the AstroWeb Database,?! which contains 
information on thousands of astronomical resources that are available over 
the WWW. The various search engines”? that automatically index the entire 
WWW < are also valuable tools for locating particular pieces of information. 

Although we make no attempt to provide a definitive list of on-line re- 
sources, Table 2.8 gives the URLs for a number of services which have proved 
particularly useful, and which are sufficiently well-established that they are 
unlikely to disappear overnight. This table also gives some indication of the 


20 The name is the rather contrived acronym formed from ‘Set of Identifications, Mea- 
surements and Bibliography for Astronomical Data.’ 

21 Bither http: //ecf.hq.eso.org/astroweb/yp-astro_resources.html or 
http://www.cv.nrao.edu/fits/www/astronomy.html — these Unified Resource Loca- 
tors (URLs) give the location of the information being accessed, and the protocol by 
which it is to be accessed. In this text, URLs are set in Courier font. 

22 For example, http: //home netscape. com/home/internet-search.html. 


Table 2.8 Selection of on-line astronomical research resources 


Resource 


SIMBAD 


NASA/IPAC Extragalactic 
Database (NED) 

ADC catalog databases 

HEASARC archive browser 

Sky View 


STScI Digitized Sky Survey 
(DSS) 


Los Alamos National Laboratory 
(LANL) e-print archive 


Astrophysics Data System (ADS) 
Abstract Service 


Description 


Unified Resource Locator (URL) 


Data resources 


Basic data and bibliographic references 
for ~ 10° objects 

Basic data and bibliographic references 
for ~ 750,000 extragalactic objects 


On-line access to ~ 2000 catalogs of 
data and tables from journals 


Archive of data and catalogs, mostly 
obtained by high energy satellites 


On-line access to sky survey images at 
a wide range of wavelengths 


On-line access to the optical sky surveys 
digitized by STScI 


Literature resources 


Electronic astronomy preprint archive. 


Abstracts and complete texts of many 
astronomical journals 


telnet ://simbad.u-strasbg.fr/ 


http: 


//cdsweb.u-strasbg.fr/Simbad .html 


telnet ://ned@ned.ipac.caltech.edu/ 


http: 
http: 
http: 
http: 
http: 


http: 
http: 


http: 
http: 


http: 


//nedwww.ipac.caltech.edu/ 


//vizier.u-strasbg.fr/ 
//adc.gsfc.nasa. gov/ 
/fadac.mtk.nao.ac.jp/ 


//heasarc.gsfc.nasa.gov/w3browse/ 
//skyview.gsfc.nasa.gov/ 
//stdatu.stsci.edu/dss/ 


//archive.eso.org/cgi-bin/dss/ 
//cadcwww.dao.nrc.ca/cadcbin/getdss 


//xxx.lanl.gov/archive/astro-ph 


(see http://xxx.lanl.gov/servers.htm1 for 
worldwide mirror services) 


http: 
http: 
http: 


//adsabs -harvard.edu/abstract_service.html 
//cdsads .u-strasbg.fr/abstract_service html 
//ads.nao.ac.jp/abstract_service.html 


Miscellaneous resources 


CADC meetings database 
AAS job register 


Listing of up-coming conferences. 
Astronomical situations vacant. 


http: 
http: 


//cadcwww.dao.nrc.ca/meet ings/meetings .html 
//waw.aas.org/JobRegister/aasjobs .html 


NOTES: An on-line version of this table can be found at http://waww.astro.soton.ac.uk/astrores.html 
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range of services now available. As well-:as the basic information provided 
by services such as SIMBAD and NED, it is now possible to access a great 
deal of raw astronomical data from various instruments. A good example 
of an on-line data archive is the High Energy Astrophysics Science Archive 
Research Center (HEASARC), which provides easy access to the data from a 
large number of X-ray and y-ray satellites. Access to the astronomical liter- 
ature has also been revolutionized by the WWW. The ADS Abstract Service 
allows one to search through all the common astronomical journals for par- 
ticular text in their abstracts or for particular objects, and even to retrieve 
the complete texts of many articles. The LANL E-print Archive maintains 
an on-line database of preprint copies of astronomical papers, and by brows- 
ing through the recent additions one can keep up with current developments 
in the field worldwide. 

This burgeoning of on-line resources has caused radical changes in the 
way in which astronomical research is undertaken. For any project, it is 
now straightforward to see what applicable data already exist in the pub- 
lic domain, and what work has already been done on the subject. It is 
also practicable to sift through vast amounts of data reasonably quickly, so 
large survey-type projects can be undertaken with relative ease. Finally, be- 
cause these on-line resources can be accessed from anywhere in the World, 
it is now possible to do leading-edge research away from the large institu- 
tions which have traditionally dominated the astronomical community. Quite 
where these changes will end remains to be seen, but it is already clear that 
a major revolution is underway. 


Problems 


2.1 In projection, a point in the bulge of M31 lies at a small distance from 
the center of a spherical globular cluster. The measured surface brightness 
at the given point is 19magarcsec~?. From the structure of the globular 
cluster, it is estimated that the intrinsic surface brightness of the cluster at 
that point is 20 mag arcsec~*. What is the surface brightness of the bulge at 


that point? 


2.2 Ig is the B-band surface brightness of a galactic disk measured in 
Le pe~*. Show that, when viewed face-on, the B-band surface brightness of 
the disk will be measured to be wg magnitudes per square arcsecond, where 


ep = 27.05 ~ 2.5 log(Ip). (2.60) 


[You will need to use the fact that the B-band absolute luminosity of the 
Sun is Mp(©) = 5.48] 


2.3 From equation (2.42), show that 
Choi = Mboi + 2.5log(L/W) — 90.195, (2.61) 


where L/W is the star’s luminosity in watts. Show further that, if the Sun’s 
V-band bolometric correction is taken to be zero, Cho) = —18.90. 


Problems 


2.4 When bolometric corrections are defined such that BCy(©) 
show that 


Mobo = —2.5 log L/ Lo + 4.64 
= —5log R/Re — 10log Teg /To + 4.64. 


3 
The Properties of Stars 


The study of galaxies is founded upon our knowledge of stars. Stars provide 
nearly all the light by which we see galaxies, and through their births and 
deaths they profoundly influence the structures of the galaxies that they 
inhabit. In this chapter we discuss the phenomenology of stars: their masses, 
radii, the characteristics of their spectra. We describe how these observable 
quantities are connected to a star’s chemical composition and the stage in 
its life-cycle that a star has reached. Then we take census of the stars near 
the Sun: how common are stars of each type? Finally, we explain how the 
knowledge of the intrinsic properties of stars that we have gathered in this 
chapter enable us to correct for the insidious effects of the interstellar dust 
through which we are obliged to peer at many remote objects. 


3.1 The masses of stars 


Fundamental determinations of stellar masses are almost invariably based 
on an application of Kepler’s third law to the orbits of binary stars. This 
law states that 

G(My + M2)P? = 47n7a°, (3.1) 


where M, and Mg are the masses of two bodies in mutual revolution on a 
relative orbit with semimajor axis a and period P, and G is the gravitational 
constant. 
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Kepler’s third law applies both to planets revolving around the Sun 
and to stars revolving around one another. Indeed, we can use the Earth’s 
motion around the Sun to make a convenient choice of units for binary stars. 
Because Mo, the mass of the Sun, is much larger than Mg, the mass of the 
Earth, we can write Mg +Mg ~ Mo to a high degree of approximation. 
Therefore, 

G MoP% = 4n7 a3, (3.2) 
where Pg is the period of revolution of the Earth around the Sun (a year), 
and aq is the mean distance from the Earth to the Sun (an astronomical 
unit). Taking the ratio of equation (3.1) to (3.2), the numerical constants 
cancel, and we can write 


(My + My) P? = a 
MoP2 > a3’ 
so that, if we express masses in solar masses, periods in years, and distances 
in astronomical units, we have simply 
(M, + .M2)P? =a. (3.4) 
By the definition of star’s parallax @ (§2.1.3), we have 


(3.3) 


a qa! 


AU w/arcsec’ 


where a” is the observed semimajor axis in arcseconds, and w is the parallax 
of the star. 


(3.5) 


3.1.1 The Mass of the Sun 


To convert stellar masses expressed in solar units to physical units, we obvi- 
ously need to know the mass of the Sun. We first determine the ratio of the 
mass of the Sun to the mass of the Earth by applying equation (3.3) to the 
orbit of the Moon around the Earth. Thus 
(Mo+M_)P?2 a? 
MoP3 7 az,’ 


me= (Fe) (22) (+H): 2) 


where terms subscripted with a dash refer to the moon. All the terms on the 
right-hand side are known from observation, including the ratio (M_/Mg), 
which is obtained by measuring the parallax in the position of a nearby 
planet that is produced by the Earth’s monthly motion around the center 
of mass of the Earth-Moon system [see Abell (1964), Chapter 8]. Using the 
result (M_/Mg) = 1/81.3 and inserting the measured values for (P_/Pg) 
and (ag/a_) into equation (3.7), we find (Mo/Mg) = 332945. The mass 
of the Earth can be measured directly [see Abell (1964), Chapter 5] and is 
found to be 5.98 x 1074kg; hence we derive Mg = 1.99 x 10°? kg. 


(3.6) 


or 
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3.1.2 Masses of binary stars 


Binary and multiple stars are quite common: at least 50% of the “stars” 
within 5 pc of the Sun are actually double or multiple, so that a large majority 
of stars are members of such systems. For our purposes it is important to 
distinguish (i) visual binaries, in which the individual components can 
be seen directly, (ii) spectroscopic binaries, which reveal their orbital 
motion by shifts in the wavelengths of their spectral lines, and (iii) eclipsing 
binaries, which show evidence of orbital motion by periodic variations in 
their apparent brightnesses due to one star passing in front of the other. We 
can obtain different kinds of astrophysical information from each of these 
groups (which are not mutually exclusive). An excellent general discussion of 
binary and multiple stars can be found in Batten (1973), while an important 
review is that of Popper (1980). 


Visual binaries When stars are examined telescopically, relatively close 
pairs are frequently found. Some of these stars are mere optical doubles; 
that is, two stars that happen to lie, by coincidence, along nearly the same 
line of sight. Herschel searched for optical doubles in the hope of measuring 
the parallax of the nearer star relative to the further one but in 1781 he 
concluded that most of the systems he had found were actually physical 
binaries: stars that orbit around one another. Careful measurements of the 
separation and position angle of the two stars make it possible to determine 
the projection onto the plane of the sky of the relative orbit of one star 
around the other. The true orbit is an ellipse, and one can show that the 
projected orbit is also an ellipse. 

Because the two components of a visual binary must be resolved, we 
tend to select systems that are widely separated in linear distance. This 
separation implies that they will have long periods, and in fact many known 
visual binaries have completed only a fraction of an orbital revolution in the 
entire time they have been observed (sometimes a century or more). The 
phenomenon of ‘seeing’ (§3.2.3) hinders measurement of binary-star sepa- 
rations smaller than about 0.15 arcsec, although interferometric techniques 
(see §3.2) now allow smaller separations to be studied. For example, Pan et 
al. (1992) have resolved the orbit of the spectroscopic binary a And, which 
has a major-axis separation of 24mas. 

From a geometric analysis of the apparent orbit [for details see Green 
(1985), Chapter 19], one can obtain the ‘elements’ that describe the shape 
and orientation of the true relative orbit of a visual binary. In particular, 
one can measure the semimajor axis a” in arcseconds. Now, if we know the 
distance to the system and hence w, we can convert the observed value of a” 
to linear measure via equation (3.5), and, knowing P, we can find the sum 
of the masses (M, + Mg) in solar units from Kepler’s third law, equation 
(3.4). In practice, the lack of reliable parallaxes, which enter as the third 
power in the expression for (M +.M2), poses a very severe difficulty for the 
determination of accurate stellar masses. 
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To find the individual masses (M1 and M2), we need to know not 
only the relative orbit of the two stars but also the orbit of each of the 
components around their common center of mass. The center of mass of a 
binary system moves nearly on a straight line. The ratio of the two masses 
is easily determined as the inverse of the ratio of the amplitudes of the 
individual motions relative to the straight-line path of the center of mass. 

Reliable masses are available for only about fifty visual binaries because 
of the difficulties of obtaining accurate distances and accurate apparent orbits 
— an easily resolved system usually has a long period, and hence only a 
fraction of its orbit has been observed, whereas systems with short periods 
are often at the limit of resolution. Lists of well-determined visual-binary 
masses and a discussion of the practical difficulties are given in Popper (1980) 
and Henry & McCarthy (1993). 

We have just noted that masses found from visual binaries depend sen- 
sitively on the parallax of the system. This fact suggests that it can be 
advantageous to turn the procedure around and use it to estimate the par- 
allax instead of the stellar masses. Thus, if a reasonably good guess can be 
made for M, and Mg, for example by assuming that the star’s masses are 
normal for their spectral types, then we can determine the distance d to the 
system. Clearly, if the system’s semi-major axis a subtends an angle @ at the 
Sun, then d = a/6. If My and Mpg are given in solar units and the period P 
in years, equation (3.4) gives a in AU. Hence 


d_ (Mi + Ma)!/8p2s fe 

pe (@/arcsec) A) 
Such a distance estimate is referred to as a dynamical distance. Because 
only the cube root of the total mass enters in this formula, even a crude 
estimate of M, and Mp will yield d with good accuracy. Once we have an 
estimate of d we can estimate the absolute magnitude of each star. We can 
then obtain a more refined estimate of M , and Mz from the appropriate 
mass—luminosity relation (see §3.5.2). With improved values for the masses, 
a better value for d can be derived from equation (3.8). If necessary, the 
procedure may be iterated; it converges rapidly. 

Distances obtained from the dynamical parallax method are not funda- 
mental as they are based on the assumption that the stars in the binary are 
normal. Nevertheless, they are often very useful estimates and occasionally 
provide an important check on other methods. For example, in the 1960s, the 
dynamical parallaxes of binaries in the Hyades were discordant with the dis- 
tance determined by the moving-cluster method (see §2.2.2), and ultimately 
led to a revision of the accepted distance to the Hyades. 


Spectroscopic binaries _ If the orbital velocities of the components of a 
binary are large enough, we can measure them by the Doppler effect with 
reasonable accuracy (§2.2.1). From these measurements we can make de- 
ductions regarding the masses of the components. Generally, such binaries 
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have short periods and small separations because these imply large mutual 
velocities. The periods may be as short as 0.1 day, in which case the stars 
may touch one another, or they can be as long as 7500 days, in which case 
the stars may be so widely separated as to be resolved as visual binaries. 
However, most of the stars for which we can determine velocities spectro- 
scopically are too close to be resolved from the ground. Such systems are 
called spectroscopic binaries. 

There are two types of spectroscopic binary. In the spectrum of a 
double-line spectroscopic binary the lines split into two components. 
These components show a periodic back-and-forth shift in wavelength but 
are in antiphase with one another. In these systems we can measure the 
velocities of both components. In the spectrum of a single-line spectro- 
scopic binary all the measurable lines move in phase with one another. In 
these systems one component is so much brighter than the other (typically, 
Am 2 2.5mag) that it swamps the spectrum of the secondary and renders 
it invisible. For a given Am, it is generally easier to detect both spectra if 
the two components are of very different colors. 

From an analysis of the radial velocity of one or both of the components 
as a function of time, one can determine the elements of the binary orbit 
[see Green (1985), Chapter 19 for detailed methods]. The orbital plane is 
inclined to the plane of the sky by some angle 7, which in general is unknown 
and cannot be determined from the spectroscopic data alone, because the 
observed radial velocity v, yields only the projection of the orbital velocity 
v along the line of sight; that is, v, = vsini. As we shall see, this fact limits 
the information we can obtain about the system unless 7 can be determined 
in some other way. 

Individual masses can be determined only for the favorable case of a 
double-line binary, so we restrict our attention to this case. Suppose for 
simplicity that the orbits of both stars are circular. Then, from the observed 
radial velocities, we can immediately determine the projected radii (in linear 
units, that is, kilometers or astronomical units) of the two orbits: equating 
distance travelled to speed times time, we have 27a; = v;P, or 


Uri P Ur2P 
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where v,; is the amplitude of the observed oscillations in the radial velocity 
of the first star, etc. The mass ratio is thus 

M v 

ahi a py (3.10) 

M2 a Uri 
Because we do not know the semi-major axis, a = a, + ag but only its 
projection asini = (v,1 + vp2)P/(2m), we cannot determine (M+ Mz) 
from Kepler’s third law [equation (3.1)] but only 
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Figure 3.1 Absolute visual magnitude versus mass for components 
of nearby binary stars. Visual and eclipsing binaries are marked by 
squares and circles, respectively. {After Latham (1998) courtesy of 
D. Latham] 


Thus from equations (3.10) and (3.11) we can find M,sin?i and M2 sin?i 
separately, but we obtain Mj, and Mg only if we can determine the inclina- 
tion by some other means. 

Because we know that sin? i must always be less than or equal to unity, 
from spectroscopic data alone we can determine a lower bound on the masses 
of the binary components. A statistical correction for inclination effects can 
be made by using the result that (sin? i) = 0.59 for orbital planes oriented 
randomly with respect to the plane of the sky (Problem 3.1). This procedure 
is open to question, however, as selection effects obviously favor the discovery 
of systems with large values of sini: these have larger radial velocities for a 
given orbital velocity. 

In this section we have studied the constraints one can impose on the 
masses of components of binary stars. The problem with visual binaries is 
that the masses depend sensitively on the system’s distance. If the radial 
velocity v;; of one component is measurable, we can determine the physi- 
cal length a, from equation (3.9), and thus obtain the distance d from our 
astrometric measurement of the angle a,/d. 

The problem with spectroscopic binaries is ignorance of sinz. In the 
case of an eclipsing binary, sinz can be measured: if the stars are reasonably 
well separated, the occurrence of eclipses implies that the orbital plane lies 
nearly in the line of sight, and hence we know that 7 = 90° and sinz ~ 1; 
in close binaries, eclipses can still occur for inclinations significantly smaller 
than 90°, but even for these systems an analysis of high-quality photometric 
data enables one to determine 7. 
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From visual binaries with accurate distances and eclipsing binaries, ac- 
curate masses have been determined for over 100 stars. Figure 3.1 shows 
these masses plotted against My. It is clear that these two variables are 
tightly correlated. 


3.2 The radii of stars 


The radius of a star is a parameter of great importance because (i) for a 
given mass, it sets the stellar surface gravity and mean density (which have 
important implications in terms of stellar structure and evolution — see Chap- 
ter 5), and (ii) for a given luminosity, it determines the effective temperature 
of stellar atmosphere (see §3.4) and hence the appearance of the star’s spec- 
trum. To measure radii, we can either (i) measure a star’s angular diameter, 
which, when combined with the star’s distance, gives its physical size, or 
(ii) use observations of the light curves and orbital velocities in an eclipsing 
spectroscopic binary to obtain the component stars’ radii in physical units, 
independent of knowledge of the system’s distance. McAlister (1985) reviews 
this topic. 

If we can resolve the disk of a star, we can measure its angular diameter 
directly. At present, this can be done for precisely one star: the Sun. The Sun 
has an angular diameter of 0.5331°, which implies that Ro = 0.004652 AU = 
6.96 x 10°km. Viewed from a distance of 2 pc, the Sun would have an angular 
diameter of only 5 mas, which is much smaller than the lower limit, ~ 03, 
on the diameter of telescope images that is set by turbulence in the Earth’s 
atmosphere. Hence, the diameters of stars that are as small as the Sun cannot 
be measured by simple imaging. Some stars have radii of the order of a few 
hundred solar radii (see §3.5 below) but typically they lie at distances of 10 
to 20 pc. Hence, even they have angular diameters S 50 mas which cannot 
be measured by ground-based imaging (although they are in principle within 
the range of the HST). However, we now discuss a number of cunning ploys 
that have enabled astronomers to measure the diameters of some dozens of 
nearby stars. 


3.2.1 Phase interferometry 


Around 1920, A.A. Michelson and F.G. Pease measured the angular diam- 
eters of several nearby stars by observing interference patterns produced in 
an interferometer formed by mounting a 25-foot (~ 8m) track bearing two 
small mirrors (equivalent to the slits of a Young’s double-slit experiment) 
on the 100-inch Mount Wilson telescope. The visibility of the fringes set 
up when this system is directed at a star of angular diameter @ declines 
as the distance D between the entrance mirrors is increased. The visibility 
effectively vanishes when the fringes produced by points on opposite sides 
of the star get 180° out of phase; this happens when D ~ $A/0 ~ 5m for 
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6 = 20mas. In this way, Michelson and Pease obtained the angular diame- 
ters of about ten red giants. These angular diameters, when combined with 
distances, give stellar radii in physical units. A purpose-built optical inter- 
ferometer operates on Mount Wilson and has measured angular diameters to 
an accuracy of S 0.1mas (Mozurkewich et al. 1991). Another purpose-built 
optical interferometer is operated by the University of Sydney (Davis 1994). 
Since ‘seeing’ (§3.2.3) is a smaller effect at infrared wavelengths, it is advan- 
tageous to measure angular diameters at the longest feasible wavelengths. Di 
Benedetto & Rabbia (1987) have used an infrared interferometer to measure 
angular diameters. 


3.2.2 Intensity interferometry 


Brown & Twiss (1956) developed a new interferometric technique that ex- 
ploits the fact that the fluctuations in the intensities of the signals received 
from a star by two instruments separated by distance D are correlated in a 
way that depends on D and the angular diameter of the star. In intensity 
interferometry, phase coherence is not required, and it becomes possible to 
use separate telescopes and large baselines (Brown 1968). An interferome- 
ter consisting of two 6.5m mosaic “light-buckets” separated by baselines of 
length 10 to 190m operated for several years at Narrabri, Australia, and 
produced angular-diameter measurements for about thirty stars (Brown et 
al. 1974b, Brown et al. 1974a). 

Angular diameters as small as 0.5mas have been measured with this 
technique. However, the method is restricted to hot stars, which (with the 
Narrabri interferometer) must be brighter than V ~ 2.5. The list of such 
stars observable from the southern hemisphere has already been exhausted, 
and new results will emerge only from northern-hemisphere observations, or 
from the use of a larger instrument that will reach fainter stars. 


3.2.3 Speckle interferometry 


In 1970 A. Labeyrie pointed out that, at any instant, the pattern of light 
produced by a star at the focus of a large telescope is speckled. Each speckle 
is an independent, diffraction-limited stellar image. Turbulence in the air 
above the telescope causes these images to dance around by introducing 
fluctuations in the phase of the wave front entering the telescope. In an 
exposure lasting longer than a fraction of a second, all these speckles are 
superimposed to produce a single splodge of brightness. One says that the 
image has been degraded by the phenomenon of ‘seeing’, and refers to the 
splodge’s full width at half maximum as the seeing. 

The technique of speckle interferometry involves making a very large 
number of short exposures of the field. An individual exposure is so short 
that the speckle pattern does not move appreciably as it is taken, and thus 
contains information on the angular size of the star. On the other hand, 
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even when the imaged star is fairly bright, the number of photons in an 
individual exposure is not great, and therefore the quality of the image it 
provides is poor. However, by combining information from a large number 
of exposures, it is possible to compensate for their poor individual quality. 
With a 5m telescope imaging at optical wavelengths, this method yields 
reliable angular diameters down to about 20mas. A review of this method 
and its accomplishments is given in Labeyrie (1978). 


3.2.4 Lunar occultations 


As the Moon (or a planet) moves on the sky, it occasionally passes in front of 
a star and occults it. Typically, the light from a star is extinguished within 
a few milliseconds. To the extent that we can idealize a star as a point source 
and the Moon’s disk as a semi-infinite plane, we expect a Fresnel diffraction 
pattern to sweep over the Earth as the moon passes in front of the star. 
This pattern can be measured by recording the brightness of the star as a 
function of time during the occultation. Because the star is not actually 
a point source but has a finite (if small) angular diameter, the diffraction 
pattern differs slightly from a Fresnel pattern, and, by a careful analysis of 
the data, one can infer the star’s angular diameter (e.g., Brown 1968, Nather 
& McCants 1970). White & Feierman (1987) list diameters obtained in this 
way for 124 stars, including several fainter than V = 7 and with angular 
diameters S 1mas. The method is obviously restricted to stars near the 
plane of the ecliptic, and it is limited in part by imperfect knowledge of the 
irregularities of the Moon’s limb. 


3.2.5 Eclipsing binaries 


It sometimes happens that the orbital plane of a binary system is observed 
nearly edge-on, so that, as the stars revolve around their common center of 
mass, one star passes in front of the other (see Figure 3.2) and hence produces 
an eclipse. There will then be periodic variations in the light received from 
the system; a plot of the system’s apparent magnitude as a function of time is 
called its light curve. The first eclipsing binary to be discovered was Algol 
(8 Per), which was noticed to be variable in 1670 and explained physically by 
J. Goodricke in 1782. Most known eclipsing systems have short periods (90% 
less than 10 days), but a few have very long periods (for example, « Aur has 
period P = 27years). The probability of discovering a long-period eclipsing 
system is small, both because the orbital plane of a widely separated system 
must be inclined almost exactly 90° to the plane of the sky and because the 
eclipses occupy only a tiny fraction of the orbital period. 

The primary eclipse occurs when the star having the higher surface 
brightness (hence higher effective temperature — see §3.4) is eclipsed, and the 
secondary eclipse occurs when the component having the lower surface 
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Figure 3.2 If the orbital plane of a 
binary happens to be nearly edge-on 
as seen from the Earth, then eclipses 
will occur as one of the components 
passes in front of the other. The 
eclipse geometry is shown above, and 
a schematic light curve for the case 
in which the smaller star has the 
greater surface brightness is shown 
below. The binary period is P. 
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brightness is eclipsed. These phenomena are illustrated in Figure 3.2 for the 
case in which the smaller star is a hot dwarf and the larger star a cool giant. 
The simplest case to analyze occurs if the orbits are circular, the orbital 
plane is inclined at 90° to the plane of the sky. Then the eclipses are total 
(primary eclipse in Figure 3.2) and annular (secondary eclipse in Figure 3.2). 
Let d, be the diameter of the larger star, and dz the diameter of the smaller. 
These diameters in units of the orbital radius a can be determined from 
the duration of the eclipses. For example, if tj,...,t4 denote times of first 
contact,...,fourth contact during the secondary eclipse (Figure 3.2), then 
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where for convenience we have assumed that d; < a and dz < a. Similar 
formulae can be written for the primary eclipse, and they provide a check on 
the results. 

Furthermore, it is clear that, if @; and f2 are the apparent brightness of 
the larger and smaller stars, respectively, and hp is the residual brightness 
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of the system at primary eclipse in units of the total brightness outside of 
eclipse, then for the case illustrated in Figure 3.2, 
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Again, a check on the results can be obtained from the secondary eclipse 
because then we know that 
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where h, is the residual brightness at secondary eclipse and k? = (d2/d,)°; 
k? should be found to be consistent with the results of equations (3.12). 

If, in addition to measuring the light curve, we can also measure the 
radial velocities of the two stars (that is, the star is an eclipsing spectroscopic 
binary), then we can determine the orbital radius in linear units as a = 
(UmaxP/2m), where Umax is the observed maximum relative radial velocity of 
the two components. We can then re-express the stellar radii in linear units. 
In this favorable case, we determine both masses (§3.1.2) and radii (from the 
eclipses) for the stars. 

More generally, a detailed analysis of the light curve yields complete 
orbital elements for the system, including parameters that specify the shape 
and orientation of the orbit (in particular the inclination 7) and that describe 
the radii and surface-brightness distributions of the two component stars 
[see Green (1985), Chapter 19 for methods]. In practice, the analysis is 
difficult because the relationships between stellar and orbital parameters and 
the properties of the observed light curve are complex. The quality of the 
results depends sensitively on the accuracy of the data. Moreover, numerous 
physical complications can occur. For example, the eclipses may be only 
partial, the orbits may be elliptical, the stars may be deformed (oblate or 
even ellipsoidal), the light curves may contain reflection effects (light from 
one star illuminating the other), or mass exchange may occur in the system. 
Most of these complications occur in close binaries, which are therefore often 
extremely troublesome to analyze [see Kopal (1959) for details]. In view of 
the complexity of the problem, it is perhaps not surprising that accurate 
results are available for only a few hundred systems (Karetnikov 1991). 


3.2.6 Astrophysical estimates 


For many astrophysically interesting stellar types, none of the direct methods 
just described can be applied, and yet even an approximate estimate of the 
radii of these stars can be of enormous importance for an understanding of 
stellar structure and evolution. In such cases, we can estimate radii by means 
of Stefan’s law for black-body radiation if we know a star’s luminosity (from 
its apparent magnitude and distance) and its effective temperature (from 
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spectroscopic evidence) — see Blackwell & Shallis (1977). Such estimates, 
although not fundamental measurements, can nevertheless be quite accurate 
and useful. 


3.3 Classification of stars 


When a star is mentioned in a conversation, we obviously want to know at 
the outset what type of star it is. Is it luminous or faint? Is it blue or red? Is 
it old or young? Is it a variable star? Does it have unusual emission at, say, 
X-ray wavelengths or in the infrared? The answers to many of these questions 
can be most concisely communicated by relating the star under discussion to 
other, better known stars. A powerful way of establishing such relations is 
to group stars into classes according to readily observable properties. Then, 
when we tell a colleague that some star is classified such-and-such, she will 
know that this star resembles certain other stars, with whose properties she 
is already familiar. Hence, an essential first step in the study of stars is the 
setting up of convenient classification schemes. 

Observations of a star rather straightforwardly yield two functions: the 
star’s spectrum, or intensity of radiation as a function of wavelength, and its 
light curve, or intensity of radiation as a function of time. Since the great 
majority of stars have rather boring light curves, most stars are classified 
by their spectra rather than their light curves, and most of this section is 
devoted to spectral classification. A few important stellar classes are defined 
in terms of light curves, however. In §§5.1.10 and 5.2.3 we shall encounter 
several of these classes. And here we mention only two. 


Novae _ A classical nova is a star that in a matter of hours brightens by 
up to 11 mag to an absolute magnitude in the range —4.8 2 My 2 —8.9. 
The precursor object is usually observed to be a white dwarf. A typical nova 
has declined by ~ 6 mag six weeks after achieving peak brightness, but some 
novae fade much more slowly. 


Pulsars A pulsar is an object that emits remarkably regular pulses of 
electromagnetic waves. A pulsar light curve usually has two peaks per pe- 
riod. The shapes of these peaks, which can be very different, vary with the 
frequency at which the light curve is measured, but the period at which the 
light curve repeats itself is the same at all frequencies, and is usually as stable 
as the periods of the of the best terrestrial clocks. Some pulsars have periods 
as short as a millisecond, while others have periods of tens of seconds. 

Soon after their discovery by Hewish et al. (1968) it was established 
that pulsars are rotating neutron stars: the cores of dead stars that have 
been crushed by gravity to densities larger than those in ordinary atomic 
nuclei — a typical neutron star has a mass of ~ 1.4Mg (Figure 5.6) and a 
radius of 10km, so its mean density is an astonishing 38 million tonnes per 
cubic centimeter. Most of the volume of a neutron star is occupied by a fluid 
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that consists primarily of neutrons but contains a sprinkling of protons and 
electrons. 

The mechanism by which pulsars pulse is not fully understood, but is 
believed to involve a magnetic field that is anchored within the star in the 
same way that the Earth’s magnetic field is anchored within the Earth, and 
rotates with the star. Two beams, or fans, of broad-band electromagnetic 
radiation sweep round as the star rotates, very much as two beams of light 
emerge from the rotating lantern of a light-house. Shklovskii (1978) gives a 
very readable account of the discovery and physics of neutron stars. 


3.3.1 Classification of stellar spectra 


There are two very different ways in which to set about extracting astro- 
physical information from complex stellar spectra. We shall categorize them 
as ‘spectrum analysis’ and ‘spectral classification’. In this section, we shall 
deal mainly with the latter, but it is worthwhile to consider both in broad 
terms for the sake of orientation. 

When we examine the light from a star with a spectrograph, we observe 
a bright continuum of radiation, upon which are superimposed absorption 
lines (and, occasionally, emission lines). The distribution of energy in the 
continuum and the profiles of the lines are determined by the physical condi- 
tions in the stellar atmosphere — its temperatures, densities, velocity fields, 
magnetic fields, element abundance and so on. Hence, by a suitable theo- 
retical analysis of these features, we can derive a detailed physical picture of 
the structure and composition of the outer layers of stars. 

The basic data required for a spectrum analysis are quantitative de- 
scriptions of line profiles, and descriptions of the frequency variation of the 
continuum. These data are obtained from spectrophotometric measure- 
ments. The continuum information can be obtained from color indices or 
from absolute energy distributions (see §2.3). The line data are obtained 
from high-dispersion spectra and are usually quantified by determining the 
equivalent width of each line: this is the width of a section of the contin- 
uum near the line which contains as much light as is either contributed by 
the line (for an emission line) or blocked by the line (for an absorption line) 
— see Figure 3.3. 

The essence of the procedure for determining the physical conditions 
in the outer layers of a star is to describe the star’s spectrum with a small 
number of numerical indices and then, by a theoretical calculation using es- 
tablished physical laws, to determine the physical conditions in the stellar 
atmosphere that are required to match these numerical values. The proce- 
dure is complicated and we shall not discuss it here [see Mihalas (1978)], 
although we shall use the results of such work in following chapters, par- 
ticularly Chapter 5. In practice, these analyses are very time consuming 
and are restricted to relatively bright stars for which suitable spectra can be 
obtained. 
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Figure 3.3 The equivalent width of 
the emission line shown here is the 
> width of the shaded trapezium; this 
trapezium has the same area as the 
diagonally hatched area. 


In spectral classification, one simply groups together spectra that 
have similar structure (morphology), without, in the first instance, seeking 
to understand the physical significance of these features. In principle, one 
could classify in terms of the whole spectrum, but in practice most classifi- 
cations are based on only the part of the spectrum transmitted through the 
Earth’s atmosphere. One first chooses one or more sets of labels (dimen- 
sions) with which to categorize the spectrum. The standard system now in 
use (the ‘MK system’ discussed below) employs two sets of labels; that is, 
it is two-dimensional. For each choice of labels, one declares the name of 
a standard star whose spectrum defines what is meant by that particular 
classification. Successive choices of labels to which standards are assigned 
establish classification ‘boxes’. and the variation of the nature of the spec- 
trum from box to box is fixed by the properties of the standards in those 
boxes. Each classification box corresponds to a unique spectral type of the 
system. When the whole system is defined, one classifies a star by finding the 
standard whose spectrum it most closely matches (or interpolates between 
standards, if necessary). In this way, each star is put into one of the boxes of 
the system. We shall see later that similar systems are used for classifying 
star clusters, galaxies, and clusters of galaxies. 

If an astute choice of dimensions has been made, and if the classifica- 
tion process itself is done with care using well-defined criteria, then (i) a 
group of stars having the same type will, in fact, be nearly identical to one 
another in those physical properties that produce variations of the classifica- 
tion criteria (but not necessarily in all properties), and (ii) the groups with 
different types will be distinguishable from one another in a significant way. 
After the system has been defined and the classification carried out, one 
then calibrates the system in terms of physical parameters, such as stellar 
temperature luminosities, compositions, and so on. Quantitative estimates 
of these parameters are derived for each spectral type by performing a de- 
tailed spectrum analysis on a typical member of that type. At that point one 
can say that, if some star has a certain spectral type, then the temperature, 
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luminosity, or other property appropriate to that type can be assigned to it 
without further analysis. The spectral type thus gives a concise description 
of both the spectrum and the physical properties of a star. 

In practice, classification is normally done by visual inspection of spec- 
tra of moderate dispersion, and can be carried out for large numbers of stars. 
Pioneering work in spectral classification was done in the 1860s by A. Secchi, 
who divided stars into four broad spectral classes. Parallel efforts were made 
at about the same time by W. Huggins and H.C. Vogel. The first great 
steps toward our present system were made at Harvard College Observa- 
tory in 1890. Under the direction of E.C. Pickering, Williamina P. Fleming 
published a catalog of 10,000 stars grouped into a system of spectral types 
denoted by the letters A, B, C, and so on. In 1888, Antonia C. Maury, 
without benefit of astrophysical data (which were almost nonexistent at the 
time), rearranged these spectral types into the order that has been used 
ever since, solely by studying the progression of line patterns observed in 
the spectra. Subsequently, Annie J. Cannon introduced decimal subdivi- 
sions of the spectral types, and, in the four years from 1911 she classified 
~ 225000 stars on this system — these classifications were published as the 
classical Henry Draper Catalog [Cannon & Pickering (1918-1924), Cannon 
(1925-1936), Cannon & Mayall (1949) - see Appendix B]. 

From the observed variation of ratios of line strengths of successive ion- 
ization stages of the chemical elements, and from photometric data, this 
spectral sequence, O, B, A, F, G, K, M, was later recognized to be primarily 
a temperature sequence, listed here in order of decreasing temperature. At 
the cool end of the sequence, additional types R, N, and S were added to 
describe stars in the K-M temperature range that have markedly different 
compositions. Finally, with the advent of M. Saha’s ionization theory in 
1920 (Saha 1920-1921), quantitative analysis became possible, and in 1925 
a comprehensive theoretical interpretation of the Harvard spectral sequence 
appeared in Cecilia Payne’s book Stellar Atmospheres (Payne 1925). 

With the work of E. Hertzsprung and H.N. Russell in the years from 
1905 to 1913, it became evident that stars of a given spectral type could have 
vastly different luminosities (see §3.5), and this fact implied that they could 
have markedly different radii and atmospheric densities. The most common, 
fainter stars are called dwarfs; brighter, larger stars are called giants; and 
the brightest, largest stars are called supergiants. The effects of differing 
stellar sizes are reflected directly in changes in the spectrum, and therefore 
a second parameter is required to describe the spectrum completely. A start 
in this direction was made in the 1890s by Antonia Maury at Harvard, who 
added the symbols a, b, or c to some spectral types; we now know that her 
type c corresponds to supergiants and the others correspond to dwarfs. 


The MK system _ A fully two-dimensional system - the MK system — 
was developed by Morgan, Keenan & Kellerman (1943). In addition to a 
spectral class, they assigned stars to one of five luminosity classes, which 
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were labeled by Roman numerals I — V. In physical terms, a star’s luminosity 
class reflects the star’s envelope size and atmospheric density and hence 
correlates with the star’s surface-gravity. The luminosity class is closely 
correlated with stellar luminosity. The MK system, with subsequent revisions 
and extensions has become the standard classification system in use today. 

The basic precepts of the MK system are: 

1. It is empirical; only directly observable features of the spectrum are 
used to determine a star’s classification. 

2. It is based on homogeneous material. It uses spectra of a standard 
dispersion that is high enough to provide sensitive criteria but low 
enough to allow one to reach faint stars, and hence ones at large 
distances in the Galaxy. 

3. It is defined by standards. Thus the classification system is au- 
tonomous, in the sense that it remains unchanged even when the 
interpretation of the types in terms of physical conditions in the 
stars changes, as models of stellar structure are refined. Further- 
more, observers using different spectrographs and dispersions can 
classify on the same system simply by reobserving the standard 
stars with their own particular equipment. 

The spectral types of the MK system are essentially those of the Harvard 
sequence, and some of the principal spectral features characterizing each of 
these types are listed in Table 3.1. The luminosity classes, and the stars to 
which they pertain, are listed in Table 3.2. The characteristics mentioned 
in these lists are only illustrative; the system is defined by standard stars. 
The complete spectral type is specified by both the spectral type and the 
luminosity class of a star as determined by comparison with the standards. 
In §3.5 we give detailed tables of the stellar physical properties (for example, 
temperatures and luminosities) that have been associated with MK spectral 
class through astrophysical calibrations. 

Spectral types are subdivided into decimal subtypes, running from 0 at 
the hot end through 9 at the cool end: BO, B1, B2, ..., B9; AO, Al, A2, ..., 
A9; FO, F1, F2, ..., F9, and so on. The luminosity classes are usually not 
subdivided except for supergiants (Table 3.2). Examples of spectral types 
are: Sun (G2V), e Ori (BOIa), a Lyr (AOV), a Tau (K5III). Stars hotter 
than the Sun (types O, B, A, F) are commonly called early types, and 
solar-type and cooler stars (types G, K, M) are called late types.1 Although 
it is explicitly two-dimensional, the MK system implies the existence of, 
and the need for, formal consideration of at least one more dimension in 
localized regions of the system in order to describe, say, weak-lined stars 
(for example, the ‘subdwarf’ HD 140283 or the variable star RR Lyrae) or 
peculiar stars (see the following discussion). 


1 These designations are archaic remnants of an obsolete scheme of stellar evolution. 
Although devoid of physical significance, they are universally used by astronomers. 
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Table 3.1 Principal characteristics of spectral types 


Spectral type 


Spectral features 


O 


He II lines visible; lines from highly ionized species, for ex- 
ample, C III, N III, O III, Si IV; H lines relatively weak; 
strong UV continuum 

He I lines strong, attain maximum at B2; He II lines ab- 
sent; H lines stronger; lower-excitation ions, for example, 
CII, O II, Si HI 

H lines attain maximum strength at AO and decrease to- 
wards later types; Mg II, Si II strong; Ca II weak and 
increasing in strength 

H weaker, Ca II stronger; lines of neutral atoms and first 
ionization states of metals appear prominently 

Solar-type spectra; Ca II lines extremely strong; neutral 
metals prominent, ions weaker; G band (CH) strong; H 
lines weakening 

Neutral metallic lines dominate; H quite weak; molecular 
bands (CH, CN) developing; continuum weak in blue 

Strong molecular bands, particularly TiO; some neutral 
lines, for example, CA I, quite strong; red continuum 


Carbon stars; strong bands of carbon compounds C2, CN, 
CO; TiO absent; temperatures in range types K and M 

Heavy-element stars; bands of ZrO, YO, LaO; neutral 
atoms strong as in types K and M; overlaps these types 
in temperature range 


Table 3.2 MK luminosity-class 
designations 


Ta-O Most extreme supergiants 
Ja Luminous supergiants 

Iab Moderate supergiants 

Ib Less-luminous supergiants 
Il__— Bright giants 

III Normal giants 

IV Subgiants 

VV Dwarfs 
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Table 3.3 Distribution of stellar 
types in HD catalog (V < 8.5) 


Spectral type Percent 


1 
10 
22 
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Table 3.4 Distribution of MK classes of 
apparently bright stars (%) 


Spectral Luminosity class 
type V Ivéeidautlé if 


More than 90% of all stars can be classified with standard MK spectral 
classes. The relative numbers of stars in different spectral types in the HD 
Catalog are listed in Table 3.3, and a similar distribution for MK classes is 
given in Table 3.4. Both of these tables refer to apparently bright stars, and 
they do not reflect true space densities, which are discussed in §3.6. 

Additional letters may be appended to spectral types to denote special 
characteristics; for example, p(peculiar), e(emission lines), f(He II and N III 
emission in O stars), and n(broad lines). Some stars have special notations. 
Some examples are peculiar A stars (Ap), which show strong lines of 
certain elements (Si, Mn, Cr, Sr, Eu) and have strong magnetic fields, and 
metallic-line A stars (Am), which show abnormally strong metal lines? 
and weak Ca II. The prefix w or D is often used to denote white dwarfs (wA, 
wG, DA, and so on), and the prefix sd denotes subdwarfs, extremely metal- 
poor dwarfs (for example, sdG). In some work, particularly at low dispersion, 
it is not possible to derive accurate MK luminosity classes, although it is 
possible nevertheless to distinguish among dwarfs, giants, and supergiants; 
this is of interest because we can then assign intrinsic brightnesses to the 
stars. In such cases, the prefixes d, g, and c are often used to denote these 
three groups (for example, dK, gK, cA). 


2 Astronomers often refer to all elements heavier than lithium as metals. In particular, 
carbon and oxygen are referred to as metals. 
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Additional spectral types are used for certain unusual stars: type C 
(which replaces Harvard types R and N) for carbon stars, whose spectra 
show strong bands of Cz and CN; S for stars whose spectra lines of heavy 
elements such as ZrO are more prominent than the usual TiO lines; and WC 
and WN (or sometimes WR) for Wolf-Rayet stars. 

Certain stars do not fit into the MK classification but nevertheless fall 
into classes. There is no systematic way of naming these classes; sometimes 
they are named after a prototypical star, sometimes after the authors of a 
catalog which identified early examples. Table 3.5 lists some of the more 
important such classes. 

The standard stars that define the MK system are given in Morgan & 
Keenan (1973), Johnson & Morgan (1953), Abt et al. (1969), Morgan, Abt 
& Tapscott (1978), and Morgan & Keenan (1973). A fairly large number of 
stars have been classified on this system, and extensive lists can be found in 
the literature (see Appendix B). A reworking of the HD Catalog has for many 
years been under way at the University of Michigan Observatory, using high- 
quality objective prism plates (Houk 1975-). This survey will yield almost a 
full two-dimensional classification, very nearly on the MK system, for most 
stars in the HD catalog. When completed, it will have enormous value for 
studies of galactic structure. 


3.4 Physical interpretation of stellar spectra 


The structure of a stellar atmosphere, and therefore the spectral energy 
distribution emitted by a star, is determined essentially by three atmospheric 
parameters: 
(1) The effective temperature, 
L 1/4 
Tog = (<a) (3.15) 
where o is the Stefan-Boltzmann constant, L the total luminosity and 
R the radius of the star. Tig specifies the amount of energy passing 
outward through unit area of the surface. Since the energy distributions 
of stars are very roughly Planckian, Teg is a measure of the temperature 
of the gas that forms the bulk of the stellar atmosphere. Table 3.7 gives 
values of Tig as a function of MK spectral class. 
(2) The surface gravity 


= — (3.16) 
where G is the gravitational constant and M is the mass of the star. 


The surface gravity fixes the pressure gradient in the atmosphere and 
largely determines the densities at which spectral lines are formed. 


Table 3.5 Classes of unusual stars 


Class 


Cepheids 


Miras 


OH/IR stars 


RR Lyrae 


T-Tauri stars 


Wolf-Rayet stars 


Defining characteristic 


Massive, luminous, variable stars with characteris- 
tic asymmetric light curves, periods 1-10 days and 
F-type spectra. 


Very red variable stars with periods 80-1000d and 
amplitudes up to 7mag. Most show emission lines. 


Stars with strong line emission at 1612 MHz from 
masing OH radicals. Also luminous mid-infrared 
emitters. 


Variable stars with characteristic asymmetric light 
curves showing periods S 1d, peak luminosity 
My ~ 0 and A-type spectra. 


Irregular variable stars with Balmer lines and Ca 
HK in emission. Frequently show Li I absorption. 
Spectral types F-M. 


Luminous O or B stars showing strong broad emis- 
sion lines with P-Cygni profiles (Figure 3.4). Also 
lines of highly-ionized species. 


Remarks 


Sometimes called 5 Cepheids or classical cepheids to distinguish 
them from W Virginis stars (see below). Their periods, lumi- 
nosities and colors satisfy a period-luminosity-color relationship 
that plays an important réle in determining the cosmic dis- 
tance scale see §§5.1.10 and 7.3.1. Their initial masses exceed 
~ 5 Mo and their ages are < 0.1 Gyr. 


Stars near their deaths. Satisfy period—luminosity relation; typ- 
ical Mp.) = —5. Frequently surrounded by shells of gas de- 
tectable in the infrared and through mm emission by CO. 


Luminous end of population of Miras. OH/IR emission comes 
from gas that has been ejected by the star. The ejection veloc- 
ity of the gas can be measured and used to estimate the age of 
the star. 


Stars of roughly solar mass. Occur in globular clusters and in 
other low-metallicity systems. 


Pre-main-sequence stars (§5.1.8), perhaps with accretion disks. 


Usually very massive, short-lived stars. WN stars show strong 
emission in lines of nitrogen, while carbon lines are prominent 
in the spectra of WC stars. 


Table 3.5 (continued) 


Class 


Defining characteristic 


Remarks 


W Virginis stars 


Dwarf novae 


Novae 


Supernovae 


Low-mass X-ray 
binaries 


High-mass X-ray 
binaries 


Old variable stars with periods 7-60d 


Short-period binary systems subject to occasional 
eruptions that brighten the system by < 6 mag in 
< 5d. SS Cyg is the prototype. Quiescent G-type 
spectra. 


Star whose brightness suddenly increases by 7 — 
16 mag 


Violently exploding stars. 

Type I supernovae reach My ~ —19 and expel 
metal-rich gas at ~ 10000kms~—!. 

Type II supernovae reach My ~ —17 and expel 
hydrogen-rich gas at ~ 5000 — 20000kms7—!. 


X-ray sources with hard spectra and low luminosi- 
ties 


X-ray sources with soft spectra and high luminosi- 
ties 


Also called Type II Cepheids; they are > 1.5 mag fainter than 
Cepheids at a given period. 


Systems contain a white dwarf and a late-type star. Erruptions 
caused by mass being transferred on to the white dwarf. 


Close binary systems containing a cool red giant and a hot, 
less-massive companion that accretes material, which then feeds 
explosive nuclear burning. Novae expel gas at S$ 3000kms~!. 


The most luminous of all stars, they are potentially important 
for determining the cosmic distance scale. The principal suppli- 
ers of heavy elements (~ 0.6. Mo of Fe per Type I supernova) 
and energy (~ 104? J per supernova) to the interstellar medium. 


Short-period systems consisting of a compact object (usually 

a neutron star) and a conventional low-mass star that fills its 

Roche lobe. The x-ray emission is powered by accretion on to 
the compact object of material from the conventional star. 


Longer-period systems consisting of a compact object (usually 
a neutron star) and an early-type star. The x-ray emission is 
powered by accretion on to the compact object of material from 
the star. 
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Table 3.6 V — K versus Teg for dwarfs and late-type 
giants 


Dwarfs Giants 


SOURCE: Dwarfs: data published in Peletier (1989); gi- 
ants: data published in Ridgway et al. (1980) 


(3) The chemical composition of the atmosphere, which determines the fre- 
quency variation of the opacity of the stellar material, and the relative 
strengths of spectral lines of the different atomic species present. We 
shall see that, to a first approximation, the chemical composition of a 
star in the solar-neighborhood may be characterized by quoting for it 
the abundance relative to hydrogen of any heavy element, such as Fe, 
because stars rich in Fe tend to be rich in other elements heavier than 
helium. On account of this phenomenon, one often speaks of the metal- 
licity of a star as a short-hand for the complete chemical composition. 


These atmospheric parameters play key réles in the theory of stellar structure 
summarized in Chapter 5, so their empirical determination is of considerable 
interest. 

The relationship between these parameters and the structure of a stellar 
spectrum is complex and can only be determined by large model-atmosphere 
programs. In practice, trial values of the parameters are adopted and used 
to calculate a theoretical spectrum, and then these values are adjusted to 
achieve the optimum fit between theory and observation. Different properties 
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Table 3.7 The effective-temperature and bolometric-correction scales 


Luminosity Class 


Spectral Vv UI I 

type Ter /K BCy Tee/K BCy Tes /K  BCy 
03 52500 —4.75 50000 —4.58 47300 —4.41 
05 44500 —4.40 42500 —4.05 40300 —3.87 
O7 38000 -—3.68 37000 —3.58 35700 —3.48 
O9 33000 —3.33 32000 —3.13 32600 -—3.18 
BO 30000 -—3.16 29000 —2.88 26500 —2.49 
B2 22000 —2.35 20300 —2.02 18500 —1.58 
B3 18700  -—1.94 17100 —1.60 16200 —1.26 
B5 15400 —1.46 15000 —1.30 13600 —0.95 
B7 13000 —1.02 13200 —0.97 12200 —0.78 
B8 11900 —0.80 12400 —0.82 11200 —0.66 
AO 9520 —0.30 10100 —0.42 9730 —0.41 
A5 8200 —0.15 8100 -—0.14 8510 —0.13 
FO 7200 —0.09 7150 -0.11 7700 —0.01 
F5 6440 -0.14 6470 —0.14 6900 —0.03 
GO 6030 —0.18 5850  —0.20 5550 —0.15 
G2 5860 —0.20 5450 —0.27 5200 —0.21 
G5 5770 —0.21 5150 —0.34 4850 —0.33 
KO 5250 —0.31 4750  —0.50 4420 —0.50 
K5 4350 —0.72 3950 —-—1.02 3850 —1.01 
MO 3850 —1.28 3800  —1.25 3650 —1.29 
M5 3240 -—2.73 3330  -—2.48 2800 —3.47 
M8 2640 —-4.1 


SOURCE: From data published in Schmidt-Kaler (1982) 


of a spectrum are sensitive to different atmospheric parameters. For example, 
the Balmer lines are particularly pressure sensitive and provide insight into 
the value of g. A good first estimate of Tyg is obtainable from a color index; 
V — K is the favored index for all but the hottest stars on account of its 
long baseline in frequency and its avoidance of the blue-ultraviolet part of 
the spectrum that contains many spectral lines and is therefore sensitive 
to metallicity. Table 3.6 lists the temperatures of dwarfs and giants as a 
function of V — K. Clearly, the relative abundances of elements is directly 
reflected in the relative strengths of their spectral lines. 

Unfortunately, a spectrum at good dispersion with high signal-to-noise 
is required if reliable results are to be obtained by fitting observed and theo- 
retical spectra in this way. Such spectra are only available for a small number 
of apparently bright stars. The best available spectra of the vast majority 
of astronomical objects are full of features that are pure noise, and a fit of a 
theoretical spectrum to such a noisy spectrum would be heavily influenced 
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Table 3.8 Spectral indices defined by Faber et al. (1985) 


Index Feature C. Band (nm) S. Bands (nm) 
G CH 428.325 — 431.700 426.825 — 428.325 
432.075 — 433.575 
Mgb Mgb 516.200 — 519.325 514.450 — 516.200 
519.325 — 520.700 
Fe; ‘Fe®, Ca® 524.800 — 528.675 523.550 — 524.925 
528.800 — 531.925 
Fey —Fe®, Cr®, 531.475 — 535.350 530.725 — 531.725 
Ca®, Tit 535.600 — 536.475 
Na NaD 587.920 — 591.050 586.300 — 587.675 
592.450 — 594.925 
H6 HG 484.950 — 487.700 482.950 — 484.825 
487.825 — 489.200 
CN CN 414.400 — 417.775 408.200 — 411.825 
424.600 — 428.475 
Mg: MgH 507.100 — 513.475 489.700 — 495.825 


530.300 ~ 536.675 
Mgo MgH,Mgb 515.600 ~ 519.750 489.700 — 495.825 


530.300 — 536.675 


TiO, TiO 593.900 — 599.525 581.900 — 585.025 
604.100 — 610.475 
TiO, TiO 619.200 — 627.325 606.900 — 614.275 


637.500 — 641.625 


NOTES: Indices in the first group have the units of equivalent 
widths — see Figure 3.3. Indices in the second group are mea- 
sured in magnitudes. 


by chance fits of spectral features to noise features. Since the results of such 
a fit are of little value, a less direct procedure is adopted to the measurement 
of the atmospheric parameters of faint objects. 

The first step is to discover by experimenting with theoretical models 
which relatively strong lines and line-ratios are sensitive to the atmospheric 
parameters. Next, one designs an index that quantifies each such interest- 
ing spectral feature. In designing this index, one aims to obtain a number 
which can be reliably estimated from a low-dispersion, high signal-to-noise 
spectrum, but which is nevertheless sensitive to the structure of the under- 
lying spectrum. Table 3.8 defines the widely employed spectral indices of 
Faber et al. (1985), while several other important indices are defined in Rose 
(1994) and Jones & Worthey (1995). Each index is defined by a central band 
of width Ap and two side bands. The ‘side-band level’ J, is defined to be the 
mean intensity over the two side bands, while the intensity of the central fea- 
ture J, is the mean intensity in the central band. The values of the first group 
of indices in Table 3.8 are reported as equivalent widths: w = (1 — I, /I,)Ao 
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(Figure 3.3). The values of indices in the second group are reported in mag- 
nitudes: —2.5log(1 — I,/I,). All of these indices are moderately sensitive to 
the temperature of the emitting star and the first step in the interpretation 
of a set of indices is to estimate the value that each index would have if 
the star had some standard V — K color rather than the measured value of 
V — kK. Also, the measured values of a star’s indices depend slightly on the 
spectral resolution employed and should first be corrected to some standard 
spectral resolution. Some of the corrected indices are primarily sensitive to 
the star’s surface gravity g, while others are sensitive to its metallicity. The 
primary metallicity indicator is 


(Fe) = 5 (Fe; + Fea), (3.17) 


where the values on the right are corrected ones. 

Next one determines the values of each index for a sample of apparently 
bright stars. A good-quality spectrum should be available for each of these 
stars so that its atmospheric parameters can be determined from a detailed 
spectral fit as described above, but the indices are measured for these stars 
using the same dispersions and equipment as are used to observe the faint 
program objects. This step enables one to assert that, say, the metallicity of 
this faint star is similar to that of this bright and well-studied star, because 
the two stars have similar values of all metallicity-sensitive spectral indices. 

The final step is to estimate the atmospheric parameters of each of the 
bright reference stars from a detailed fit of a high-quality spectrum with a 
theoretical spectrum. As more complete and reliable atomic-physics data be- 
come available and model-atmosphere computer codes become more sophis- 
ticated, the best available values of the atmospheric parameters of bright 
stars are constantly being revised. The scheme we have just outlined, in 
which faint objects are linked to bright stars by means of spectral indices, 
makes it possible to update the atmospheric parameters of faint stars along 
with those of bright stars without having to re-observe them. 

Hundreds of spectroscopic analyses of stellar spectra have been per- 
formed. The results are most reliable for stars of types A—K; the optical 
spectra of O and B stars have few strong metal lines because in their hot 
atmospheres the heavy elements are highly ionized, while the spectra of M 
stars show strong absorption by molecules, which is difficult to model theo- 
retically. To give some idea of the accuracy that can be achieved, a spectrum 
of a G-dwarf at ~ 2nm resolution typically yields uncertainties in Teg, g and 
metallicity of 20K, 0.03 dex and 0.02, respectively;? if only Str6mgren pho- 
tometry is available, the uncertainties in each quantity are larger by a factor 
~ 2.5 (Cayrel 1992). 

In the Sun the abundances by number of various elements relative to 
hydrogen are roughly as follows: helium, 0.10; carbon, 3 x 10~*; nitrogen, 


3 The notation dex indicates an increment in the logarithm to base 10. 
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mm 


Figure 3.4 A P-Cygni line profile 

is characterized by an absorption 
trough on the blue wing of an emis- 
gion line. oa 

1 x 1074; oxygen, 6 x 1074; neon, 1 x 107+; magnesium, 3 x 1075; silicon, 
3x 1075; and iron, 4x 1075. Element abundances in other stars are generally 
reported relative to these abundances. For example, the “iron abundance,” 


denoted [Fe/H], is defined to be 


n(Fe) n(Fe) 
Fe/H] = 1 a —| ——~]}. 3.18 
[ e/ ] O10 (a po S810 (a . ( ) 
Similarly, the “relative abundance” of an element other than iron is often 
expressed in terms of a symbol such as [O/Fe], which is defined by 


[0 /Fe] = logo (335) be Ga 5 ) : (3.19) 


[Fe/H] varies from as low as S —3 to > 0. Ratios such as [O/Fe] cover a 
much smaller range: that is, stars poor in Fe are generally also poor in other 
metals. Consequently it is useful to define parameters (X, Y, Z), which give 
the fractional abundances by weight of hydrogen, helium, and everything 
else, respectively. For the solar mixture, X = 0.70, Y = 0.28, and Z = 0.02. 

A small number of stars show anomalous line strengths that may be in- 
dicative of unusual atmospheric abundances. Examples include Ap and AM 
stars, carbon stars, S stars, Wolf-Rayet stars (§3.3). Some of these anomalous 
line strengths have been theoretically traced to credible abundance anoma- 
lies. Others remain puzzling and it is unclear whether they are simply due to 
anomalous abundances or will be eventually explained by more sophisticated 
model atmospheres (which include, for example, the effects of magnetic fields 
and stellar rotation) in terms of normal abundances. 

The shape of a spectral line, in addition to its strength, can convey 
important astrophysical information. A case in point is that of lines that 
have a P-Cygni profile. P Cygni is a 5th-magnitude B star whose spectrum 
contains strong emission lines that go over into absorption at their blue edges 
(Figure 3.4). The physical interpretation of this phenomenon is that along 
the line of sight to the star, there is matter flowing away from the star and 
towards us. This material absorbs radiation in the same spectral line that 
the star emits, but this absorption is blueshifted relative to the star. An 
analogous phenomenon is observed in many stars, and is usually interpreted 
as evidence that the star is losing mass in a strong stellar wind. 


> 
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Figure 3.5 The CM-diagram for 
10793 stars with good Hipparcos 
parallaxes. The great majority of 
stars fall along the MS that runs di- 
agonally from bottom right to top 
left. The subgiant, red giant, and 
a, white-dwarf sequences are also ap- 
| a parent, as is the red clump. The MS 
and WD stars were selected to have 
parallaxes with errors smaller than 
10%, while the giants were chosen to 
tt have parallaxes in error by < 20%. 
[After Perryman et al. (1995) cour- 
B-V (mag) tesy of M. Perryman] 


Absolute magnitude, M(Hp) 


10 


3.5 Color-magnitude diagrams 


Unquestionably the most important correlation among stellar properties dis- 
covered to date is the Hertzsprung—Russell diagram (HR-diagram), 
which was developed independently by the Dane E. Hertzsprung in 1911 and 
the American H. N. Russell in 1913. The HR-diagram has been of profound 
importance to the development of our understanding of stellar evolution; it 
provides both one of the most stringent tests of the theory of stellar evo- 
lution, and one of the most incisive tools for exploring the history of the 
Galaxy as a whole. 

In its original form, the HR-diagram was a plot of absolute visual magni- 
tude versus spectral type, but variants are now more commonly used in which 
the discrete spectral type is replaced by a continuous coordinate. Observa- 
tionally, the most useful form is the color-magnitude (CM) diagram, 
which is a plot of a color versus a magnitude, either absolute (for stars of 
known distances) or apparent (if the stars are known to be all at the same 
distance). Figure 3.5, in which each point represents a nearby field star of 
known distance, is an example of this genre. Insofar as stars of a given MK 
spectral class have a unique color, a CM-diagram is equivalent to the classical 
HR-diagram. 

From a theoretical point of view, the most convenient form of the HR- 
diagram is a plot of Teg versus log(L/L@); this is usually called the theo- 
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retical CM-diagram. Insofar as all stars of a given MK spectral class have 
the same luminosity and effective temperature, the theoretical CM-diagram 
is equivalent to both a classical HR-diagram and a CM-diagram. However, 
to make a quantitative correspondence between the theoretical CM-diagram 
and a CM-diagram, we must know the relations T.g(MK spectral class) and 
BC(MK spectral class) with good accuracy. Establishing this mapping be- 
tween the two diagrams remains a significant astrophysical problem (§5.1.1). 


3.5.1 Observed CM-diagrams 


Figure 3.5 shows the CM-diagram for nearly 11000 stars with Hipparcos 
parallaxes. Clearly the stars are not scattered at random in this figure, but 
fall into several distinctive groups or sequences. The vast majority of all 
stars lie along the main sequence (MS), which stretches from luminous, 
hot, blue, O stars to faint, cool, red, M stars; these stars are the dwarfs 
(MK luminosity class V). The subgiant branch (SGB) joins the MS at 
B-V ~0.7 and Myp ~ 4 and extends horizontally to B—V ~ 1, where the 
populated region turns sharply upwards into the red-giant branch (RGB). 
The RGB stretches from (B — V) ~ 1, at a luminosity about thirty times 
(3.7 mag) brighter than MS stars of the same type, toward cooler and brighter 
stars. These stars are the red giants; they correspond to MK luminosity class 
III. The strong enhancement in the density of stars near B — V = 1.1 and 
Mup = 1 is known as the red clump. Centered on B— V ~ 0, Myp ~ 12 
we find a few faint, hot stars known as white dwarfs. The region between 
the MS and the RGB at Myp ~ 1 that is almost devoid of stars is known as 
the Hertzsprung gap. 

The difficult step in the construction of Figure 3.5 is the determination 
of the distance to each star that is plotted, in order to calculate the star’s 
absolute magnitude My, from its apparent magnitude. Since even Hipparcos 
data only allow us to determine the distances to relatively nearby stars, it 
should not be surprising if some of the more unusual classes of star are 
unrepresented in this plot. 

Our knowledge of the CM-diagram can be rounded out by studying 
star clusters. As we shall see in Chapter 6, the Galaxy contains many such 
agglomerations of stars, which include a rich variety of stellar types. In 
general, we will not know the distance to a cluster, so we cannot plot its 
members on an absolute-magnitude scale as a CM diagram such as Figure 3.5 
requires. However, most clusters are sufficiently small relative to our distance 
from them that we can assume that all a cluster’s stars lie at the same 
distance. Thus, the difference between absolute and apparent magnitude 
will be the same for every star, so the structures that we have traced in 
Figure 3.5 should be apparent in a plot of apparent magnitude versus color 
for cluster members. In fact, as we show in Chapter 6, features in the CM- 
diagrams of clusters tend to be even more sharply defined than those in 
Figure 3.5. The absence of errors due to uncertainties in the distances of 
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stars reduces the scatter in the stellar sequences of a cluster compared to 
that in the sequences of nearby stars. Further, it turns out that stars in 
clusters have very homogeneous properties, which also helps squeeze cluster 
stars into narrowly defined regions of the CM-diagram. 


3.5.2 Luminosity and color as functions of spectral class 


As its name implies, the MK luminosity class that a star belongs to is an indi- 
cation of the star’s luminosity. The range of luminosities that are associated 
with a given luminosity class varies with spectral type, so the complete MK 
spectral classification of a star is required before one can predict the star’s 
luminosity from its spectrum. Figure 3.6 gives an approximate indication 
of the combined dependence of luminosity on spectral type and luminosity 
class; at a given spectral type, stars of each luminosity class will mostly lie 
within a magnitude or two of the appropriate curve in the figure. 

Since MK spectral type is tightly correlated with B —V color, and MK 
luminosity class is connected with luminosity, the complete MK classification 
of a star gives one a fair idea of where the star lies in the CM diagram. In 
so far as stars are confined to narrow sequences in a CM diagram such as 
Figure 3.5, even a rough prediction of a star’s location in the CM diagram 
enables us to predict its luminosity rather accurately. For example, Figure 3.5 
indicates that a MS star with B — V = 0 is likely to have My, ~ 1.5, and 
in §3.6.4 we shall see that the error on this estimate is smaller than 0.5 mag. 
Other cases may be more problematical. A giant star with B—V = 1 could lie 
on the SGB at My, ~ 3, or in the red clump at My, ~ 1.4 It may or may not 
be possible to distinguish these cases on the basis of additional information, 
such as the star’s MK luminosity class (III or II), its value of a gravity- 
sensitive spectral index (§3.4), or its apparent magnitude and likely distance. 


4 See Table 2.1 for the definition of Hipparcos magnitudes, Myp. 
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In other cases an approximate location in the CM diagram does not enable 
us to refine our original estimate of the star’s luminosity because the star 
in question is not expected to lie on a narrow sequence in the CM diagram. 
Consider, for example, a supergiant (MK luminosity class I). Figure 3.6 shows 
that supergiants occur over the whole of the upper part of the CM diagram, 
so we cannot argue that a particular object is likely to lie on some narrow 
sequence in the CM diagram, and cannot improve on the estimate of its 
luminosity that follows from its MK spectral class. Hence, the accuracy 
with which luminosity can be inferred from MK class varies considerably 
with location in the CM diagram. Nonetheless, relations between absolute 
magnitude and MK class play an important part in astronomical research. 
Figure 3.6 gives approximate relations for most classes. Tables 3.9 to 3.11 
repeat some of these data in tabular form, as well as specifying relations 
between MK spectral type and color for MK luminosity classes I, III and 
V. It should be borne in mind that Figure 3.6 and Tables 3.9 to 3.11, like 
Figure 3.5, refer to stars of approximately solar metallicity. 

The data used in this section rely on several types of observations. In 
§3.6 we shall find that low-luminosity stars are relatively numerous. Con- 
sequently, spectral types fainter than My ~ 1 usually have several nearby 
representatives for which accurate trigonometric parallaxes, and hence dis- 
tances and luminosities, have been determined. For spectral types brighter 
than My ~ 1, there is often no representative with a good trigonometric par- 
allax. Determinations of the luminosities of such spectral types rely heavily 
on secular and statistical distances (§2.2). Clusters also play an important 
role in the determination of the luminosities of certain spectral-types be- 
cause they contain both stars whose luminosities can be calibrated locally, 
for example from trigonometric parallaxes, with rarer stars that cannot be 
studied near the Sun. Since all the stars in a given cluster usually have very 
similar distances, knowledge of the luminosity of any star in the cluster in 
principle yields a good estimate of the luminosity of any other cluster star 
(Chapter 6). Moreover, in §2.2.2 we saw that there is an absolute method 
of determining the distances to, and hence luminosities of, stars in nearby 
clusters. 

The main techniques used for the calibration of various stellar types can 
be summarized as follows: 


1. supergiants: clusters; 
2. O~A stars: clusters, secular and statistical parallaxes; 
3. F-M dwarfs: trigonometric parallaxes, moving cluster method; 
4. F-M giants: moving cluster method, clusters, secular and statistical 
parallaxes; 
5. White dwarfs: trigonometric parallaxes, binaries, clusters. 
Special calibration techniques may also be helpful in specific cases. For ex- 


ample, absolute magnitudes can be estimated for several supergiants which 
are components of noninteracting binaries containing another star of known 
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Table 3.9 The zero-age main sequence (ZAMS) 
(BEV = Bi Me | Bas C= Bie Me 


—0.30 —1.08 —3.25 0.30 0.03 2.8 
—0.25 —0.90  —2.1 0.40 —0.01 3.4 
—0.20 —0.69 —1.1 0.50 0.00 4.1 
—0.15 —0.5 —0.2 0.60 0.08 4.7 
—0.10 —0.30 0.6 0.70 0.23 5.2 
—0.05 —0.10 1.1 0.80 0.42 5.8 
0.00 0.01 1.5 0.90 0.63 6.3 
0.05 0.05 1.7 1.00 0.86 6.7 
0.10 0.08 1.9 1.10 1.03 7.1 
0.20 0.10 2.4 1.20 1.13 7.5 


SOURCE: From data published in Schmidt-Kaler (1982) 


absolute magnitude (for example, a dwarf), by taking the difference in ap- 
parent magnitudes of the two components to equal the difference in their 
absolute magnitudes. Implicit in this method is the assumption that both 
stars in the system are normal and unaffected by the presence of the com- 
panion. 

The stars on the MS in Figure 3.5 do not all have the same age, and 
in Chapter 5 we shall see that in consequence they have a significant spread 
in absolute magnitude at a given color. The stars of a hypothetical popu- 
lation of zero age would form a narrower MS, the so-called zero-age main 
sequence or ZAMS. Table 3.9 lists U — B and My as a function of B— V 
along the ZAMS. Compiling this table would be straightforward if observa- 
tions of a nearby, populous and very young cluster were available. Sadly, no 
such cluster exists. Hence one has to make do with clusters of non-negligible 
age. In §6.2.1 we shall see that as a cluster ages, the most luminous stars 
move away from the ZAMS first, with the result that the older a cluster 
is, the fainter is the magnitude below which stars still lie near the ZAMS. 
Consequently, the standard strategy for determining the ZAMS has been 
to use nearby but old clusters at faint magnitudes, and more distant and 
younger clusters at bright magnitudes. Hence, data are used only for stars 
which lie near the ZAMS, and for such stars stellar-evolution theory (Chap- 
ter 5) can predict accurately by how much they lie above the ZAMS. The 
relative distances to the clusters employed is determined by requiring that 
between them the clusters define a single ZAMS. Recently, data from the 
Hipparcos satellite have suggested that this crucial assumption may be in 
error (Mermilliod et al. 1997) but it is too early to assess the implications of 
this finding. 

Table 3.10 gives values of My and several colors for dwarfs and for 
luminosity-class JIT giants where these lie on the RGB. As we saw above, 
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Table 3.10 Luminosities and colors of dwarfs and luminosity-class III 
giants 
Dwarfs (luminosity class V) 
MK clas My (U-B) (B-V) (V-J) (J-H) (H-K) 


O3 -6.0 —1.15 —0.33 —0.73 —0.19 —0.05 
O5 —5.7 1.15 —0.33 —0.73 —0.19 —0.05 
O8 —4.9  —1.14 —0.32 —0.72 —0.17 —0.05 
BO —-4.0  —1.08 —0.30 —0.68 —0.12 —0.05 
B3 -16 -—0.71 —0.20 —0.46 —0.08 —0.03 
B5 -1.2 —0.60 —0.17 —0.38 —0.06 —0.02 
B8 —0.2  —0.38 —0.11 —0.24 —0.03 —0.02 
AO 0.6 —0.06 —0.02 —0.05 0.00 0.00 
A5 1.9 0.11 0.15 0.28 0.08 0.01 
FO 2.7 0.06 0.30 0.59 0.16 0.02 
F5 3.5 0.01 0.44 0.79 0.20 0.03 
GO 4.4 0.10 0.58 0.92 0.24 0.04 
G5 5.1 0.23 0.68 1.19 0.31 0.05 
KO 5.9 0.45 0.81 1.37 0.38 0.06 
K5 7.4 1.03 1.15 2.08 0.57 0.10 
MO 8.8 1.25 1.40 2.79 0.65 0.15 
M5 12.3 1.22 1.64 4.48 0.52 0.28 


Giants (luminosity class III) 


MK class My (U-B) (B-V) (V-J) (J-H) (H-K) 


G5 0.9 0.50 0.86 1.49 0.44 0.04 
KO 0.7 0.82 1.00 1.72 0.50 0.07 
K5 —0.2 1.68 1.50 2.83 0.74 0.16 
MO -—0.4 1.81 1.56 2.95 0.76 0.17 
M2 = = -0.6 1.84 1.59 3.09 0.76 0.18 
M5 -0.3 1.88 1.63 3.42 0.78 0.19 


SOURCE: From data published in Schmidt-Kaler (1982) and Peletier (1989) 


it is for dwarfs and these giants that the correlation between MK class and 
luminosity is tightest. The absolute magnitudes tabulated for each spec- 
tral class are averages for stars of that class. Thus the MS given in the 
table includes an admixture of somewhat evolved stars, and it will therefore 
necessarily be brighter than the ZAMS for stars earlier than type G. For 
example, the ZAMS for O stars lies almost a magnitude below the MS given 
in Table 3.10. 

Table 3.11 list values of My and colors for luminosity class I supergiants. 
These stars are rare objects so they do not have nearby representatives for 
which absolute distances are readily obtained. Moreover, they are not con- 
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Table 3.11 Luminosities and colors of supergiants 


Spectral 


SOURCE: From data published in Schmidt-Kaler (1982) 


Table 3.12 Luminosities of white dwarfs 


Class DB DA DF DG DM 
My 10-11 11-12 13-14 14-15 15 


fined to a narrow sequence in the CM diagram, but occur all over the upper 
part of the CM diagram. These facts make it inevitable that there is con- 
siderable scatter in the luminosities of these objects for any given spectral 
class, and the numbers in Table 3.11 should be used with more caution than 
the numbers in Table 3.10. 

Table 3.11 shows that, for the hottest stars, both (B — V) and (U — B) 
reach limiting values. This is a result of the fact that, at high temperatures 
(T 2 40,000 K), the Planck function in the visible reduces to the Rayleigh- 
Jeans form B,(T) = (2kTv?/hc?). It follows from equation (2.33) that, in 
this limit, the color indices will be essentially independent of T. 

Table 3.12 gives characteristic ranges of absolute magnitude for the var- 
ious white-dwarf spectral types. 

The data in Tables 3.10 and 3.11 can be used to construct color—color 
diagrams (or two-color diagrams), such as those shown in Figure 3.7. If 
adequate allowance can be made for interstellar reddening, stars of different 
spectral and luminosity classes can be distinguished on the basis of colors 
alone. Similar diagrams using other photometric indices can sometimes be 
used to carry out a two-dimensional photometric “spectral classification” 
within limited regions of the CM-diagram. This is valuable because we can 
then classify very faint stars. 
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3.5.3 The physical properties of stars on the MS and RGB 


The spectrum of a star is essentially determined by its effective temperature 
and surface gravity (see §3.4). Furthermore, because a stellar spectrum can 
be characterized by giving its MK spectral class, to each MK spectral class, 
there should correspond definite values of T.g, BC, and the surface gravity 
g. Table 3.7 gives T.g¢ and BC as functions of MK class. The most reliable 
values given are for main-sequence B—K stars. The accuracy of the estimates 
deteriorates for O and M stars and for giants and supergiants. Table 3.7 
shows that MK spectral type correlates closely with Teg. The MK luminosity 
classes are primarily sensitive to the surface gravity g. 

Naively, one would expect massive stars to be very luminous and stars 
with small masses to be faint. Broadly speaking, it is true that faint stars 
have small masses, but it turns out that luminous stars are not necessarily 
massive. As we shall see in Chapter 5, the luminosity of a star tends to 
increase as it evolves, and this increase is very pronounced for low-mass 
stars. Thus, toward the end of its life, even a low-mass star can become a 
luminous giant. On the MS, however, both a mass—luminosity relation 
and a mass-radius relation hold. Table 3.13 gives these relations (cf. 
§5.1.7). On the RGB there is quite a well-defined relation between spectral 
type and both radius and luminosity (Table 3.14). 


3.6 The stellar luminosity function 


The density of stars clearly varies from point to point within the Galaxy. 
Moreover, within any volume of space there will be both luminous and faint 
stars. Let the number dN of stars with absolute magnitudes in (M+dM, M) 
in the volume d*x around the point x be 


dN = ®(M,x) dM d®x. (3.20) 
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Table 3.13 Physical properties of MS stars 


Spectral 
type M/Mo  log(L/Le) Mobo My R/Ro ?p/Po 
O03 120 6.15 —10.7 —6.0 15 0.035 
O5 60 5.90 —10.1 —5.7 12 0.035 
08 23 5.23 —8.4 —4.9 8.5 0.037 
BO 17.5 4.72 -—7.1 —4.0 7.4 0.043 
B3 7.6 3.28 —3.5 —1.6 4.8 0.069 
B5 5.9 2.92 —2.7 —1.2 3.9 0.099 
B8 3.8 2.26 —1.0 —0.2 3.0 0.14 
AO 2.9 1.73 0.3 0.6 2.4 0.21 
A5 2.0 1.15 1.7 1.9 1.7 0.41 
FO 1.6 0.81 2.6 2.7 1.5 0.47 
F5 1.3 0.51 3.4 3.5 1.3 0.59 
GO 1.05 0.18 4.2 4.4 1.1 0.79 
G5 0.92 —0.10 4.9 5.1 0.92 1.18 
KO 0.79 —0.38 5.6 5.9 0.85 1.29 
K5 0.67 —0.82 6.7 7.4 0.72 1.79 
Mo 0.51 —1.11 7.4 8.8 0.60 2.36 
M5 0.21 —1.96 9.6 12.3 0.27 10.7 
M7 0.12 —2.47 10.8 14.3 0.18 20.6 
M8 0.06 —2.92 11.9 16.0 0.1 60 


SOURCE: Data published in Schmidt-Kaler (1982) 


Table 3.14 Radii and luminosities of red giants 


| Go Gs KO K5 Mo Ms 
log(R/Ro) | 08 10 12 14 16 19 


log(L/Lo) | 15 1.7 19 2.3 2.6 3.0 
SOURCE: From data published in Allen (1973) 


To a first approximation it is useful to imagine that the mix of stars of differ- 
ent luminosities is the same everywhere. To express this idea mathematically, 
we approximate the function ®(M,x) defined above by the product of two 
functions ®(M) and v(x). That is, we write 


dN = [6(M) dM][v(x) d*x]. (3.21) 


®(M) is called a luminosity function, which measures the relative frac- 
tions of stars of different luminosities, while y(x) measures the total number 
density of stars at the point x. In its simplest form, ® gives the distribution 
over luminosity of stars irrespective of their spectral or physical types. In 
this case we call ® the general luminosity function. Frequently we are 
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interested in the distribution over luminosity of stars of a special type, for 
example MS stars, or, more specifically, say, stars of MK spectral class G5 
V. Hence, we shall encounter many different stellar luminosity functions. 

The concept of a luminosity function generalizes, moreover, to extra- 
galactic studies. In particular, in equation (3.21) v(x) could be the number 
density of normal galaxies within the Universe and ®(M) could be the lumi- 
nosity function of normal galaxies, which we shall study in §4.1.3. Similarly, 
v(x) might be the number density of some species of active galaxies, such as 
Seyferts (see §4.6) and ®(M) could be the luminosity function of Seyferts. 

Equation (3.21) does not determine the normalization of ® since we 
can multiply ® by any constant provided we divide v by the same constant. 
Conventionally, stellar luminosity functions for the solar neighborhood are 
normalized such that ®(M)dM gives the number of stars per cubic par- 
sec near the Sun with magnitudes in the range (M+dM,M). With this 
normalization, v(x) is the ratio of the stellar density at x to that near the 
Sun. 


3.6.1 Malmquist bias 


Most methods for determining a luminosity function involve counting the 
number dN/dm of objects that have apparent magnitudes in the range 
(m+dm,m) and that lie within some given area of the sky. The star-count 
function 


A(m) = — (3.22) 


clearly depends on both the spatial distribution of the objects and on their 
luminosity function. Since it is impossible to determine A(m) to arbitrar- 
ily faint magnitudes, there will be some limiting magnitude m, such that 
A(m) is available only for m < m. The simplest sample of objects upon 
which A(m) could be based is magnitude-limited in that it consists of all 
objects brighter than m, that he within a specified area of the sky. It is not 
hard to see that the mean absolute magnitude of objects in such a sample will 
be brighter than the mean absolute magnitude of the population as a whole: 
the volume within which we can see the most luminous objects is larger than 
that within which we can also see the faintest objects. Consequently, lumi- 
nous objects are over-represented in a magnitude-limited sample. This effect 
is called Malmquist bias after the Swedish astronomer K.G. Malmquist 
(Malmquist 1922, Malmquist 1936). It plays an important role in nearly ev- 
ery field of astronomy. We now evaluate the effect of Malmquist bias for the 
case in which the luminosity function of the population is a Gaussian. This 
analysis leads to a relationship between the population and sample means, 
equation (3.34), which is often used to correct for Malmquist bias. Unfortu- 
nately, this equation involves the variance o? of the luminosity function, so 
it is necessary to derive equation (3.39), which relates the sample variance 
a2, too". 
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Let us calculate the number of stars in a magnitude-limited sample 
that have absolute magnitudes in (M + dM, M) and apparent magnitudes 
in (m+dm,m): 

d?N dn / Os 
= aM) (=) (3.23) 
dmdM ds \Om/ mM 
where s(m, M) is the distance to a star of the given apparent and absolute 
magnitudes and dn is the number of stars that lie within the surveyed region 
of the sky at distances in (s+ds,s). If the survey covers a solid angle w, the 
volume at distances in (s + ds, s) is ws*ds and we have that 


a= ws*v(s). (3.24) 


Inserting (3.24) into (3.23), multiplying both sides of the resulting equation 
by dM and integrating, we find that 


Atm) = [" am 23 au [” amainn(S) sts) (3.25) 


We now change the integration variable from M to s, which is a function of 
M through equation (2.36).5 This operation yields 


AGiiae |  aeSON LAS: (3.26) 


where we have taken advantage of the relation (9M/0s), = —(Om/0s)u, 
which is an immediate consequence of equation (2.36). 

Consider now the mean absolute magnitude of the stars in the sample 
that have apparent magnitude m: 


Ls ee lens (3.27) 


By the reasoning that carried us from equation (3.23) to equation (3.26), 
equation (3.27) is equivalent to 


_ i ds M®(M)s?v(s) 


(Mn = fo ds ®(M)s?v(s) 


(3.28) 


Notice that the bottom of this equation is the integral that appears in equa- 
tion (3.26) for A(m). Moreover, in this integral M is a function of m through 


5 In equation (2.36) we must substitute s for d. 
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equation (2.36) with (8@M/0m), = 1, so on differentiating (3.26) with respect 
to m we obtain aa fs a 
—= wf ds ——s*v(s). (3.29) 
0 


Comparing this result with equation (3.28), we see that 


ida = i d® (3 30 
Adm \&dM a 0) 
Differentiating equation (3.29) with respect to m we similarly find that 
1d?A 1 do 
Aan = Gon ee) 


To proceed further we must adopt a definite functional form for the 
luminosity function 6(M). Consider the case in which ®(M) is a Gaussian 


distribution: 
1 ( _(M- ) 


o(M) = Gaga OP 202 


(3.32) 
where for simplicity we have set the local stellar density to unity. The con- 
stants My and o appearing in equation (3.32) are, respectively, the mean 
absolute magnitude and the dispersion in absolute magnitude of a volume- 
limited sample of the stars — that is, a sample formed by all the stars that lie 
within some specified volume. When © is given by equation (3.32), equation 
(3.30) becomes 


1d4_  /M—My 
7 dinA 
(M),, — Mo = -o? ——. (3.34) 
dm 


Since the number of objects in a magnitude-limited survey almost invariably 
increases towards fainter apparent magnitudes, dA/dm > 0. Consequently, 
equation (3.34) implies that the objects in the survey that have any a given 
apparent magnitude m, will, on the average, have a higher luminosity than 
the mean luminosity of the population as a whole. 

The quantity (M),, is potentially measurable. For example, we could 
determine trigonometric parallaxes for all the stars of a magnitude-limited 
survey that have apparent magnitudes in some narrow, suitably bright range. 
Once the parallaxes of these stars were known, their absolute magnitudes 
could be calculated, and thus (M),,, determined. If the dispersion o of the 
luminosity function were known, equation (3.34) could be used to ‘correct 
(M),,, for Malmquist bias’, that is to infer Mo from (M),, and the observed 
form of A(m). Unfortunately, o will, in general, have to be determined from 
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the same survey, and, as we now show, its measurement is also complicated 
by Malmquist bias. 
When we substitute equation (3.32) into equation (3.31), the latter be- 


comes 
1@A_ /(M-M\* 1 (3.35) 
Adm? — o? o2 | © , 
Hence 
1 Z 1  1dA 
AM ~ Mo) m — 5a = A dma 
d?InA 1 dA\2 
ae aga) ee) 
_ d?ind - (M)_,—Mo\? 
dm? a? ; 
= d7InA 
4° in 2 2 2 
= ((M—- M — ({(M)_, — Mo)” - 
2 
=(M?) —(M),, — 0. 
Now the variance of the measured absolute magnitudes is 
Om = ((M — (M) in)” ) mg = (M?), — (Min (3.38) 
so from equation (3.37) we have that 
d?InA 
=O" ; 3.39 
Om SO Oe (3.39) 


To assess the significance of this equation, consider the case of a spatially 
uniform population: v(s) = constant. From equation (2.36) it follows that 
(0s/8m)y = (0.21n10)s, and with a little algebra we can show that for 
constant v, equation (3.25) may be written 


A(m) « ame | dM 


—c 


a { (* + a — — Mo] + “) \ (3.40) 


x exp [0.6 In 10(m — Mo)]. 
Hence in the case of a homogeneously distributed population, 


In 10 : 
alee AAO he eg (3.41) 


seat J 


dm dm dm 
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Consequently, by equations (3.34) and (3.39), neither (M),, — Mo, nor 02, — 
ao” depends upon m. This fact can be used as a test of the hypothesis 
that the luminosity function is Gaussian: one can divide the sample by 
apparent magnitude, and for each subsample evaluate the mean and variance 
of M. If these quantities vary significantly between subsamples, either the 
Gaussian hypothesis or the assumption of homogeneity must be rejected. 
Since equations (3.34) and (3.39) are only valid for a Gaussian luminosity 
function, they must not be used to correct for Malmquist bias if there is 
evidence that the luminosity function is non-Gaussian. 

One case in which the assumptions of a Gaussian luminosity function 
and homogeneity are reasonably valid is that of MS stars with good Hipparcos 
parallaxes. As we will see in §3.6.4, in this case 0, ~ 0.5 mag, so by equations 
(3.34) and (3.41), (M),, — Mo ~ 0.15 mag. 

Equations (3.39) and (3.41) show that in the case of a homogeneously 
distributed population, the sample variance o,, is an unbiased estimator of 
the population variance 0. However, it often happens that a population’s 
density falls with distance from the Sun — perhaps because the surveyed 
field lies towards a Galactic pole. In this case, the star-count function A(m) 
will increase with m less rapidly at faint than at bright magnitudes, with 
the result that d?1In A/dm? < 0 and equation (3.39) then implies that the 
sample variance will be smaller than the population variance. 


3.6.2 Lutz-Kelker Bias 


The underlying cause of the Malmquist bias — that, for a survey of fixed 
solid angle, the volume studied increases with distance — gives rise to a sim- 
ilar effect in the analysis of trigonometric parallaxes. This effect, known as 
the Lutz-Kelker bias, causes an observed parallax to be on average higher 
than its true value (Lutz & Kelker 1973). This overestimate translates into 
an underestimate of distance, and hence an underestimate of an object’s lu- 
minosity as derived from its apparent brightness. Given the renewed interest 
in parallax determinations excited by Hipparcos, this bias and its eradication 
are of some importance. 

To quantify the importance of this effect, we now calculate the proba- 
bility P(w|w’)dw that the true parallax of a given star lies in (w,w +d) 
given that its measured parallax is w'. Bayes theorem [equation (C.4)] tells 
us that this probability is given by 


P(a|o’) = aAcfersel, (3.42) 


where P(a'|~)dw’ is the probability that observational errors will cause the 
parallax of a star that has true parallax @ to be measured to be w’, and 
P(m@)dw and P(a')dw’ are ‘prior probabilities’ which reflect the knowledge 
that we have of the star before we measure its parallax. Specifically, P(w)dw 
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is the prior probability that the star has true parallax in (w,a@ +d), and 
P(q@')do' is the prior probability of measuring a parallax in (w’,w'’+da’'). 
Since we are interested in the dependence of P(w|w’) on mw at fixed a’, 
we do not need to determine the prior probability in the denominator of 
equation (3.42). The prior probability in the numerator of equation (3.42), 
by contrast, is of fundamental importance. 

The prior P(w) will reflect (i) the known value of the star’s apparent 
magnitude m, and (ii) the probabilities that we assign to the various possible 
values of the star’s absolute magnitude M. Quantitatively, let P(s|m)ds be 
the probability of finding a star of magnitude m at a distance in (s,s + ds). 
By Bayes theorem (equation (C.4)) this is given by 


(3.43) 


where P(m|s)dm is the probability that a star at distance s has apparent 
magnitude m and P(s)ds is the probability of finding a star at distance s. 
From the definition of the luminosity function, equation (3.21), we have 


P(m|s) « ®[M(s,m)] and P(s) x v(s)s?, (3.44) 


while P(m), the probability density of observing of star of apparent magni- 
tude m, irrespective of distance, is proportional to the star-count function 
A(m) calculated above. Equations (3.43) and (3.44) now yield that the 
prior probability that a star of apparent magnitude m has true parallax in 
(w,w +dza) is 


Os 
dw = =| d 
Hie Psi) dn i; A M =m + 5log(w/10) 


where { 
s=1/a. 


x 6(M)v(s)s” 


Os 

—| d 

felon] f ws 
(3.45) 


When we evaluate the derivative in this equation and use the result to elim- 
inate P(w) from equation (3.42), we obtain 


P(w|0’) « P(w'|\w)®(M)v(s)a~*. (3.46) 


To proceed further we must make assumptions regarding the distribution 
of measuring errors, P(w'|@), the luminosity function ®(M) and the space- 
density of stars v(s). We assume that the probability of measuring a parallax 
ow’ for a star whose true parallax is w is given by the Gaussian 


a exp ae (3.47) 


: P(a'|w) = 


3.6 The stellar luminosity function 117 


where o,, is the standard deviation of the measurement errors. Since par- 
allaxes can only be measured for nearby stars, a reasonable assumption re- 
garding v is that it is independent of s. With this assumption and equation 
(3.47), equation (3.46) becomes 


P(w|') x 6(M)w~* exp SS . (3.48) 


20 n2 


where M = m+ 5log(w/10). The noteworthy feature of equation (3.48) is 
the factor w~*, which strongly biases the probability P(w|w') that the star 
has true parallax @ towards small values of w. This is Lutz—Kelker bias. 
For given m, very small values of @ are associated with very bright absolute 
magnitudes M, at which the luminosity function ® will be small. Hence 
Lutz—Kelker bias is moderated by the luminosity function. 

The form of ®(M) that we should use in equation (3.48) depends on 
what we know about the star before we measure its parallax. If we know the 
MK spectral class of the star, ® should be the luminosity function for that 
MK class, which can be approximated by a Gaussian — see equation (3.57) 
below. At sufficiently bright magnitudes, ® then declines extremely strongly 
and Lutz-Kelker bias is strongly moderated. If we only know the star’s 
apparent magnitude, ® should be the general luminosity function of nearby 
stars. We shall see below that at bright absolute magnitudes this can be 
approximated by a straight line in a plot of log® versus M, with slope 
~ 0.65: 


&(M) = constant x 10°”, 


[m+5og(=)) where 6 ~ 0.65. (3.49) 


= constant x 10° 


Substituting this simple form of 6(M) into equation (3.48) we conclude that 
when we know only the star’s apparent magnitude prior to measuring its 
parallax ow’, the probability distribution of its true parallax is 


P(w|a') « exp S| eet, (3.50) 


Figure 3.8 plots this probability distribution for three values of w’/o,, and 
2 = 0.65, when the exponent 53 — 4 ~ —0.75. When w'/o, = 2, the peak 
in the probability density occurs at very small values of true parallax, w. 
Physically, this result implies that unless a parallax measurement is more 
than a 2¢ result, nothing useful can be inferred from it about the star’s 
distance because it is at least as likely to have a negligible true parallax as a 
value near the measured value. The curve of P for the case w'/o, = 3 peaks 
only once, near the measured parallax, but the peak occurs significantly to 
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Figure 3.8 The probability dis- 
tribution P(w|w’) given by equa- 
tion (3.50) for 8 = 0.65 and 
w'/O@ = 2,3,4. Each curve is la- 
beled by its value of aw! /ow. 


Table 3.15 The Lutz-Kelker bias in parallax and absolute magnitude 
and its 90% confidence limits 


—0.016 —0.033 —0.043 —0.065 —0.082 —0.134 —0.235 
38.7 18.7 13.7 8.6 6.6 3.5 1.2 
41.3 21.2 16.2 11.2 9.2 6.1 4.0 
—0.001 —0.004 —0.006 -—0.014 —0.022 —0.059 ~—0.177 


wL/On 
wu/Ow 


the left of the measured ‘parallax. In fact, by setting the derivative of equation 
(3.50) to zero it is easy to show that the peak occurs at 


w aw? m2 
—=} (= + +49) (3.51) 


2 
on 


The offset @ — w’ between the location of this peak and the measured par- 
allax, which is a measure of the seriousness of Lutz—Kelker bias, is given 
in Table 3.15, which also gives wy and wy, the lower and upper 90% con- 
fidence limits of @. For example, if w’/o, = 5, the most probable value 
of w is w’ — 0.134 and 90% of the probability lies between w = 3.50, 
and w = 6.lo,,. On account of the non-Gaussian nature of the distribu- 
tion P(w|a’), these confidence limits are asymmetric, and exceed those that 
would be returned by a naive propagation of the observational errors. 

Since parallaxes are often used to calibrate the absolute magnitudes 
of objects, it is useful to define the correction that should be applied to the 
magnitude inferred from the raw parallax measurement, M’, into a better es- 
timate of the true absolute magnitude M = M'+5log(w/w’) [see equations 
(2.7) and (2.36)]. A plausible form of this correction is obtained by setting 
@ equal to the value given by equation (3.51). The last row of Table 3.15 
gives the corresponding values of M — M’. 

Notice that Lutz—Kelker bias leads to an essentially circular problem: 
to correct for this bias we need to know the luminosity function of stars ®, 
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which is precisely what we are trying to determine when we first encounter 
Lutz—Kelker bias. In such circumstances we proceed iteratively: we use a 
guess at the form of ® to correct for bias, use the corrected data to measure ® 
and then repeat the analysis with refined corrections. The important thing is 
to be aware of the existence of the bias and to understand that its magnitude 
depends on the criteria upon which a given star was included in a parallax 
program. 


3.6.3 The general luminosity function 


Cluster luminosity functions Conceptually the simplest method of de- 
termining ® is to count stars within a cluster of known distance. If it were 
possible to resolve every star more luminous than some limiting absolute 
magnitude Mmax, then, for M < Myax, ®(M) would be proportional to the 
number of stars actually counted in (M+dM,M). Actually, at faint magni- 
tudes only some fraction ¢(M) of the stars actually present will be detected 
because faint stars tend to be missed through ‘confusion’ — see §6.1.1. We 
defer discussion of how one corrects for confusion and the details of clus- 
ter luminosity functions to Chapter 6, but note here that observations with 
HST suggest that at magnitudes My 2 6, which correspond to stars with 
MS-lifetimes longer than the age of the Galaxy, ®(M) has the same general 
form as it does in the field near the Sun. Specifically, for M; S 9, ®{Mr) 
increases with My, but at some characteristic magnitude M? ~ 9, which 
depends on the cluster’s metallicity, ®(1/,) flattens off and then turns down 
as M, becomes fainter than MP (Figure 6.16). 


Photometrically complete surveys Consider now the problem of de- 
termining ®(M) from the mix of stars near the Sun. Clearly, this problem 
involves counting stars within a well defined volume for each of a series of 
intervals of absolute magnitude. The volume within which stars are counted 
will generally be different for every magnitude interval since, on the one hand, 
we want our samples to be as large as possible to minimize statistical errors, 
and, on the other hand, luminous stars are comparatively rare but can be 
seen to large distances. Hence it is helpful for the volume of our census to 
increase with the luminosity of the stars being counted. One convenient way 
of sampling volumes that increase with luminosity is to form a magnitude- 
limited sample. With this strategy, our sampling volume increases as L*/?, 
where L is the luminosity of the sampled stars. 

To determine the star’s absolute magnitude, we need an estimate of its 
distance d. Ideally this would be a trigonometric distance. Unfortunate'y, 
trigonometric parallaxes are available only for a tiny fraction of all stars ~ 
Hipparcos determined parallaxes for all stars brighter than m = 8 mag and 
for only a fraction of the fainter stars. The lowest-luminosity MS stars have 
My ~ 18mag, so such a star is guaranteed to have a Hipparcos parallax 
only if it lies closer than d = 0.001 pc! Although such very low-luminosity 
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stars are amongst the most numerous stars within the Galaxy, there is a 
negligible chance that one of them will lie so close to the Sun. Hence we have 
no chance of obtaining a useful estimate of the faint end of the luminosity 
function simply by relying on the complete Hipparcos parallax catalog. Yet 
long before the Hipparcos catalog became available, reasonable estimates of 
®(M) down to My ~ 17 had been made using parallax data that were order- 
of magnitude worse than those in the Hipparcos catalog. How was this task 
be accomplished? 

One important strategy involves the use of photometric distances (§2.1.3). 
From the colors, and perhaps a low-resolution spectrum, of each star in our 
magnitude-limited survey, we decide what kind of star it is, and hence what 
its luminosity is. Its distance then follows from equation (2.36). Notice 
that this procedure does not eliminate the need for trigonometric parallaxes, 
since the calibration of the relation between colors and luminosities will be 
based on trigonometric parallaxes. The key point is that for this calibrating 
step one requires parallaxes for only a few faint stars rather than a complete 
sample of faint stars — see §10.4.3 for details. 

Since color data are required for the determination of each star’s dis- 
tance, color generally features in the original magnitude-limited sample. For 
example, one might define the sample to be all stars brighter than some 
limiting magnitude in the V band and redder than some particular value of 
V —I. In the absence of interstellar reddening (which will be unimportant if 
the volume sampled is not large), whether a star satisfies the V — I criterion 
does not depend on its distance, so the selection criteria will turn up all suf- 
ficiently red stars that lie within a luminosity-dependent volume. The great 
advantage of imposing a color criterion is that it enables us to focus on one 
particular type of object, for example low-luminosity MS stars or cool white 
dwarfs, which can only be studied at faint apparent magnitudes. Without 
the color criterion, any complete sample of faint stars would include large 
numbers of distant luminous objects, whose number density has already been 
determined at brighter apparent magnitudes. 


Proper-motion selected surveys Since a powerful strategy for identi- 
fying stars that lie close to the Sun is to look for stars with large proper 
motions, a classical procedure for determining ®(M) relies upon a catalog 
of high-proper motion stars. This procedure is particularly important for 
the determination of the space density of halo stars (see §10.5.2) which are 
scarce near the Sun but are the dominant population more than ~ 1 kpc from 
the plane. As these stars pass through the solar neighborhood, they have 
large space velocities (§10.5.2), so they have high probabilities of entering a 
proper-motion catalog. 

The determination of ®(M/) from a proper-motion catalog involves han- 
dling difficulties that arise from the incompleteness of the observations and 
selection effects. Since similar difficulties arise in many branches of as- 
tronomy, such as the analysis of pulsars, X-ray sources, radio galaxies, and 
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Figure 3.9 The observed sample 
of stars is complete below the full 
curve. At the very bright mag- 
nitudes that lie below the dotted 
curve, there are too few stars to 
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quasars, we now describe the derivation of ®(M) from a proper-motion cat- 
alog in some detail. 

Consider the stars within a sphere of radius 100 pc centered on the Sun. 
This volume is large enough to contain a statistically useful sample of stars 
but still small enough that interstellar absorption can be neglected within 
it. Inside this sphere, stellar parallaxes can be measured accurately because 
they are > 10 mas. 

We begin by counting the number of stars for a specified set of values of 
apparent magnitudes m and proper motions yp in a proper-motion catalog. 
We restrict the region in the (m, 4) plane over which counts are made (see 
Figure 3.9) by fixing, for each m, a minimum proper motion po chosen to 
be large enough to ensure that, at the apparent magnitude specified, most 
stars with 4 > wo do appear in the proper-motion catalogs. Representative 
choices for various ranges of m are as follows: (m, po) = (5,0); (5 — 9,0%2); 
(9 — 15,05). Now we estimate from the catalogs the probability P..:(m, 1) 
that a star with magnitude m and proper motion yz has a measured parallax 
a’. We have 


number of stars with chosen (m, 4) and measured w! 


Pez (m, pw) & . (3.52) 


number of stars with chosen (m, ) 
Those stars referred to in the numerator of equation (3.52) will have some 
distribution over parallax. If we can assume that all stars in the (m, 4) group 
have the same distribution, then PS} (m, 1) is a valid estimate of the factor 
by which any star with some measured parallax within some (m,) group 
should be multiplied to give the total number that really have that value of 
the parallax in that (m, ) group. Thus to correct for incompleteness of the 
parallax data, we count each star that has a measured parallax as Pop (m, 1) 
stars. By summing over all values of 1 at a set of fixed values of (m, aw’), we 
can then construct numerically the distribution function G’(m, a'|u > po), 
which is such that 


dN = G'(m,a'|u > uo) dmda' (3.53) 


122 Chapter 3: The Properties of Stars 


Box 3.1: Inverse problems 


Equation (3.54) is an integral equation of the type 


fa) = i déK (2, €)9(8), (1) 


where f and g are observed and theoretical distributions, respectively, 
and K is a smooth, non-negative kernel. An equation of this type is said 
to pose an inverse problem. The key to dealing with such an equation 
is to avoid solving it, as we now explain. 

Since we expect g to be a smooth function, f should be extra smooth. 
In reality, f won’t be so smooth because noise — from small-number 
statistics and observational errors — will add jaggedness. In fact, in many 
cases f will be more jagged than any non-negative function g would be 
after convolution by the smooth kernel K. 

If you actually solve an equation such as (3.54), you find that with f 
estimated from high-quality data, g comes out implausibly bumpy, and, 
as the quality of the data deteriorates, g soon has unphysical negative 
stretches. So solving (3.54) will produce garbage unless the quality of the 
input data is high. But no matter how good the data, it is stupid to solve 
the equation, because in doing so one is failing to exploit one’s ability to 
distinguish signal from noise. It is actually an advantage that noise in f 
quickly produces an unphysical distribution g, since this implies that the 
effects of noise can be limited by exploiting one’s prior knowledge that g 
is smooth and non-negative. A powerful general method for doing this 
is described in Appendix C. 


gives the number of stars with proper motion in excess of jig that have m in 
(m+dm,m) and a’ in (w’, aw’ + doa’). 

We now consider the effects of errors in the measured parallaxes. If 
the errors of measurement are random, then it is reasonable to model the 
probability P(@’|w) of measuring a parallax w’ for a star of true parallax 7 
by equation (3.47). In this model, the true distribution of stars in the (m, a) 
plane, G(m, w| > po), is related to the observed distribution, G’(m, o'|u > 
Ho); by 


1 7 (w' —w)? 
’ ! we 
G'(m, @'|w > wo) = as ie daw G(m, a| > po) exp [er ‘ 


(3.54) 
Mathematically, this is an integral equation for G given G’, which is equal 
to G convolved with the Gaussian kernel. Equations of this type frequently 
occur in astronomy. Appendix C discusses how they should be handled. Here 
we simply assume that a method such as that described in Appendix C has 
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Figure 3.10 For values of @ and 
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been used to estimate G from G’. 

Next, we allow for the fact that some stars within the 100 pc sphere have 
proper motions below our proper-motion completeness limit jo; these stars 
are missing from the proper-motion catalogs, but they must be included in 
the determination of @(M). Given a kinematic model for the distribution 
of stellar random velocities, both in direction and in speed (see §10.5.2), we 
can estimate the probability P(u > uo|@) that a star of parallax @ will have 
a proper motion in excess of jo. The quantity 


G(m, ow|u > Lo) 


F(m,w)dmdw = PCS sla) 


dmdw (3.55) 


is then the total number of stars of apparent magnitude m and parallax 7 
within the 100 pc sphere, irrespective of proper motion. For more detail see 
Trumpler & Weaver (1953), 375-377. 

The final step in deriving ®(M/) is to convert F(m,@), the distribu- 
tion over apparent magnitude m, into a distribution H(M, ww) over absolute 
magnitude M via the relation 


M=m+5+5logm. (3.56) 


This conversion can be done reliably between the two curves of in Figure 3.10. 
For each value of M, we then sum the tabulated values of H(M, a) over all 
accessible values of w and divide the sum by the volume corresponding to 
that range of @. We then have our estimate of ®(M), the number of stars 
per unit absolute magnitude per cubic parsec in the solar neighborhood. 
Table 3.16 presents the general luminosity function. For each absolute 
magnitude My three numbers are given: (i) the number (My) of stars with 
magnitudes in the range (My + 3, My - 3) per 104 pc; (ii) the luminosity 
dL emitted by these stars; (iii) the mass dM contained in these stars. Fig- 
ure 3.11 plots as a function of My: (i) the cumulative luminosity density 
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Table 3.16 The general luminosity function (per 104 pc?) 


My ®(My) 6L/Lo 6M/Mo | My &(My) 6L/Lo 6M/Mo 


—6 0.0001 2.6 0.005 iG 29 4.0 21.3 
—5 0.0006 5.1 0.020 8 33 1.8 21.8 
—4 0.0029 9.4 0.060 9 42 0.90 24.2 
-—3 0.013 17.1 0.17 10 70 0.60 35.0 
—2 0.05 28.2 0.05 11 90 0.30 36.0 
—-1 0.25 53.9 1.6 12 127 0.17 36.3 
Oo 1 95.8 4.0 13 102 0.055 20.8 
1 3 111 7.4 14 102 0.022 16.3 
2 5 64 8.7 15 127 0.011 16.3 
3 12 66 17.3 16 102 0.0035 10.5 
4 17 36 19.4 17 51 0.0007 4.3 
5 29 25 28.1 18 22 0.0001 1.6 
6 30 10 24.7 19 13 0.0000 0.7 
Total 1008 532 356 


SOURCE: For My < 0 from data published in Allen (1973); for My > 0 
from data published in Jahreiss & Wielen (1983) and Kroupa, Tout & 
Gilmore (1990) 


jv from stars more luminous than My (full curve); (ii) the cumulative mass 
density p contributed by these stars (dotted curve); and (iii) the mass-to- 
light ratio Ty = p/jy of the stars more luminous than My. Inspection of 
Table 3.16 reveals several noteworthy features. 


1. 


Most stars in the solar neighborhood are intrinsically faint. The maxi- 
mum in ®(M) occurs near My = 14. These stars are typically K and 
M dwarfs. 


. Essentially all the light emitted is contributed by intrinsically luminous 


stars despite the fact that they are extremely rare. The peak emission 
comes from stars with M ~ 1. These stars are typically A dwarfs and 
K and M giants. It is worth stressing just how strong this bias toward 
high luminosities really is. For example, stars at My ~ —4, typically B 
stars, emit as much light per unit volume in the V band as do the stars 
at My ~ +6, mostly G and K dwarfs, even though the latter are 10000 
times as numerous and contain 400 times as much mass! The average 
V-band light output per unit volume is about 0.053 Le pe~4 


. Most of the stellar mass density in the solar neighborhood is contributed 


by the vast number of low-luminosity stars. Figure 3.11 shows clearly 
that whereas nearly all the luminosity comes from stars brighter than 
the Sun (M, = 4.83), the mass is fairly uniformly distributed in the 
magnitude range 3 < My < 15. The large mass in faint stars has the 
unpleasant implication that the dynamics of the Galaxy are dominated 
by stars that are at best inconspicuous, while the very luminous objects 
that can be readily observed in our own galaxy and in other galaxies 
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Figure 3.11 The full curve shows 
the V-band luminosity density (in 
10-!Le pe-) in stars brighter than 
My. The dotted curve shows the 
mass density (in 10-1 Mg pe~3) 
in these stars. The dashed curve 
shows the corresponding values of 
ooo e the mass-to-light ratio Ty. [From 
pe data published in Jahreiss & Wielen 
0 poiteiitiivitiriitin (1983) and after Kroupa, Tout & 
—5 0 5 10 15 Gilmore (1990) from data kindly 
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contribute little to the gravitational forces in the system. Therefore, 
except insofar as the few high-luminosity stars can be used as tracers 
of the entire population, it is dangerous to assume that the luminosity 
density in galaxies is proportional to the mass density. The only safe 
procedure is to infer the mass distribution by studying the gravitational 
field of the whole system through the dynamics of the population of 
bright stars. The total mass density from the stars in Table 3.16 is about 
0.036 Ma pe~3. However, these data omit most of the white dwarfs, 
which are numerous, are intrinsically very faint (My ~ +13 to +16), 
and yet have masses of the order of 0.5 to 1 Mg. One finds (Weidemann 
1990) that they contribute approximately an additional 0.003 Mg pe~? 
(see Tables 3.19 and 3.20). Hence the total stellar mass density is about 
0.039 Mw pe3. This number gives a feeling for the relative emptiness 
of space in the Galactic disk in the solar neighborhood. 


. The average mass-to-light ratio — the mass per unit volume divided 


by the light emission per unit volume (in solar units) — is Ty ~ 
0.67 Mg/Lo. This number is useful, for, insofar as it is typical for 
a galactic disk, it can be used to convert observed light distributions in 
other galaxies into first estimates of stellar mass distributions, thereby 
providing some input to dynamical analyses of these objects. On the 
other hand, note that even for the solar neighborhood the value of Ty 
just quoted is surely a lower limit, because many kinds of objects (for 
example, faint companions in multiple stars, dead white dwarfs and neu- 
tron stars, stars lost in dense interstellar clouds, and so on) would evade 
discovery. Cumulatively, such objects could add appreciably to the mass 
density while contributing little or nothing to the light. 

We have mentioned that Malmquist bias causes a sample that is com- 


plete down to some limiting apparent magnitude Vo to contain a higher pro- 
portion of luminous stars than a volume-limited sample. Table 3.17 quantifies 
this phenomenon by giving the expected absolute-magnitude distributions of 
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Table 3.17 Absolute-magnitude distributions of volume- and magnitude- 
limited samples of 100 stars 


My -6 —-5 -4 -3 -2 -1 0 12 3 4 5 
Nyoi 0.0 0.0 0.0 00 01 04 15 48 7 18 25 48 
Ny=2.5 4 7 10 12 12 1 16 145 3 1 «0 
Nyy =6 2 4 6 10 12 17 19 17 6 4 2 1 
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Figure 3.12 The open squares show the luminosity function ob- 
tained by Reid & Gilmore (1982) from a color-selected survey. The 
filled triangles show the luminosity function obtained by Wielen, 
Jahreiss & Kriiger (1983) from a proper-motion selected sample. 


100 stars in a volume-limited sample, and in two magnitude-limited samples 
(see Problem 3.6 for details). 

If any one point emerges from the preceding discussion, it is that the 
behavior of ®(M) at faint magnitudes is of great importance to Galactic 
research. For this reason, tremendous effort has been lavished on its de- 
termination. Figure 3.12 shows results from two important studies of the 
faint-end luminosity function: the open points are from Reid & Gilmore 
(1982), who carried out a color-selected survey, while the filled points are 
from Wielen, Jahreiss & Kriiger (1983), whose data were based on proper- 
motion surveys. Both studies show a peak in 6(M) at My ~ 12.5, rather 
than at My = 14 as Table 3.16 implies, and recent HST observations of faint 
red stars are consistent with 6(M) peaking at My ~ 12.5 (Gould, Bahcall & 
Flynn 1996, Santiago, Gilmore & Elson 1996). In §5.1.9 we shall see that this 
peak reflects the rapid decline in the luminosity of a MS star as its mass M 
approaches the limit for hydrogen burning at M ~ 0.08 Mo. On the faint 
side of this peak, the data from the color-selected survey fall off more steeply 
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than do those from the proper-motion surveys. Kroupa, Tout & Gilmore 
(1993) argue that a combination of photometric errors and the existence of 
unresolved binaries within a color-selected survey cause the resulting values 
of ®(M) to fall below the true values. Hence the full symbols in Figure 3.12 
probably more accurately reflect the truth. Nevertheless, the structure of 
®(M) for My 2 12 must be regarded as rather ill determined despite the 
tremendous amount of labor that has been lavished on it.§ 


3.6.4 The luminosity function of a given MK spectral class 


The general luminosity function lumps together all spectral classes, and, at 
a given M, several radically different kinds of stars may contribute to ®(M). 
It is therefore also interesting to break down the luminosity function by the 
stars’ spectral classes, S. 

To determine ®(M,S), we proceed essentially as we did in the determi- 
nation of ®(M), but for one spectral class at a time. The same problems 
arise as before, one of the more troublesome being the scarcity of luminous 
stars. Consequently, (M,S) is even less well determined than the general 
luminosity function ®(M). 

For each MK class S, 6(M,S) peaks at some absolute magnitude Mo(S) 
that is characteristic of the class. For some MK classes, for example those 
with luminosity classes I and II, the speed with which @(M,S) falls as one 
moves away from M,(S) is simply a measure of the sensitivity of MK system 
to luminosity. That is, ®(M, S) falls as one moves away from Mo(S) simply 
because as |M — Mo(S)| increases at fixed spectral type, stars are more 
likely to be classified as belonging to a different luminosity class. Figure 3.6 
implies that for classes S with luminosity classes I or II, the full width at half 
maximum of the peak in ®(M,S) is 2-3mag and we cannot make a more 
precise statement about the structure of ®(M, S) at this time. 

The case of a MK class with luminosity class V (MS stars) is different, 
for now as one moves vertically in the CM diagram from Mo(S), the density 
of stars falls. Thus the width of the peak in ®(M, S) does not merely reflect 
the luminosity-sensitivity of the MK system, but a real, physical paucity of 
stars. This fact makes the peak in ®(M,S) narrowest for luminosity class 
V. Moreover, MS stars are numerous, so Murray et al. (1997) and Houk 
et al. (1997) were able to determine the form of ®(M,S) rather accurately 
for MK classes from late AV to early KV. Figure 3.13 shows four of their 
histograms that are, in effect, plots of ®(M, S) as a function of M at fixed S. 
In each panel a dotted vertical line shows the absolute magnitude brighter 
than which the sample should be volume-limited. Fainter than this line, 
both dotted and full histograms can be distinguished. The dotted histogram 
shows the stars actually counted, while the full histogram shows the result 


§ For an independent determination of ®(M) at faint magnitudes, see Kirkpatrick et 
al. (1994). 
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Figure 3.13 Distributions over luminosity of MS stars with given spectral types. The 
dotted histograms show the stars actually counted, while the full histograms show these 
data after correction for Malmquist bias — the dotted vertical lines show the magnitude 
brighter than which the sample should be volume-limited. The curves are Gaussian fits to 
the full histograms. The ordinate, AMy is centered on the median absolute magnitude of 
each sample. [After Houk et al. (1997)| 


of correcting these counts for Malmquist bias to the counts expected from a 
volume-limited sample. 

The histograms in Figure 3.13 have a characteristically skew shape, 
which rises more sharply at faint magnitudes than it falls at bright magni- 
tudes. At least part of this asymmetry will be due to the unresolved binary 
stars in the sample — in such a case the MK classification is liable to be 
based on the brighter star, while both binary components contribute to the 
measured magnitude of the system. 

When Gaussian distributions 


®(M,S) = == exp ( - a), (3.57) 


are fitted to histograms like those shown in Figure 3.13, the resulting dis- 
persions o lie in the range 0.3 to 0.5mag. The stars studied by Houk et 
al. were chosen because they have trigonometric parallaxes with standard 
errors smaller than 10%. Since a 10% distance error gives rise to an error 
of 0.2 mag in absolute magnitude, when we correct for the effect of these 
distance errors, a dispersion in measured absolute magnitude of 0.4 mag cor- 
responds to a dispersion in intrinsic absolute magnitude of order 0.35 mag. 
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Table 3.18 Luminosity function of MS stars 


0 1 2 3 4 5 6 7 8 9 
B8.5 A1.5 A5.66 F1.9 F7.8 G4.3 K0.3 K3.7 K7.1 MO0.3 


1 3 5 12 17 29 30 29° 33 42 
0.23 0.74 19 96 19 24 24 33 17 20 
0.07 0.12 0.2 04 06 O09 1.7 4 6 12 


SOURCE: From data published in Murray et al. (1997) 


Table 3.19 Number density of stars by spectral type 


Spectral type 


Luminosity group O BA F G K 

Giants and supergiants —0.3 0.2 0.6 —0.6 0.8 
Main sequence —3.6 00 0.7 14 18 20 2.8 2.9 
White dwarfs 18 2.0 1.7 1.7 1.4 2.4 


NOTES: Each entry is the logarithm of the number of stars brighter than 
My = 16 in 104 pc’. 
SOURCE: From data published in Allen (1973) 


Consequently, in favorable cases photometric distances to MS stars can have 
standard errors down to ~ 17%. The non-Gaussian nature of the histograms 
shown in Figure 3.13 requires, however, that some caution be exercised in 
the analysis of errors in photometric distances. In particular, the long tail to 
bright magnitudes has the effect of significantly increasing the upper limits 
on photometric distances. 

Whereas Houk et al. (1997) determined the dependence of ®(M,S) at 
fixed S, Murray et al. (1997) used the same sample of stars with accurate par- 
allaxes to determine the luminosity function, ®)ys of MS stars, irrespective 
of spectral type. Table 3.18 gives these values alongside the corresponding 
values of the general luminosity function from Table 3.16. One sees that 
at spectral types later than S = F2 (My 2 3), stars of luminosity class V 
account for most of the general luminosity function. This is in accordance 
with what we would expect from inspection of Figure 3.5, which shows that 
the SGB joins the MS near My = 3.5 with the result that at fainter magni- 
tudes nearly all stars are of luminosity class V. Table 3.18 suggests that at 
My S 2, the majority of stars near the Sun are giants of one sort or another. 

Table 3.19 gives the number densities, down to My = 16, of giant 
stars, MS stars and white dwarfs in the solar neighbourhood, irrespective of 
luminosity. The total space density of white dwarfs in the solar neighborhood 
can be seen to be about 30% that of MS stars, while the total density of giants 
and supergiants is about 1% that of the MS. 

The local mass density of these stellar types is given in Table 3.20, 
which again shows how the dynamical effects arising from stars in the disk 
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Table 3.20 Mass densities (Mg/10° pe) contributed by 
various objects to the solar neighborhood 


Object Mass density Object 


Mass density 


O-B 0.9 White dwarfs 20 

A 1 Cepheids 0.001 

F 3 Long-period variables 0.001 
dG 4 Planetary nebulae 5 x 1078 
dK 9 Galactic clusters 0.04 
dM 25 

gG 0.8 Subdwarfs 0.15 

gk 0.1 RR Lyrae variables 1076 
gM 0.01 Globular clusters 0.001 


SOURCE: From data published in Allen (1973), Oort & Plaut 
(1975) and Schmidt (1965) 


are dominated by the most inconspicuous stars — the K and M dwarfs and the 
white dwarfs. For completeness, the table also lists average mass densities 
from other distinctive disk objects. The last three entries give local densities 
of members of the spheroidal component — see §10.5.2 (that from globular 
clusters being averaged over a spherical shell of the appropriate radius). Of 
these objects, only the subdwarfs have an appreciable local mass density, 
about 3% of the total. As we shall see in §10.2, the situation is entirely 
different near the Galactic center, where the mass density of the spheroidal 
component dominates that of the disk. 


3.6.5 Catalogs of the nearby stars 


It is very interesting to draw up an inventory of all known stellar objects in 
the immediate vicinity of the Sun. One chooses a small volume so that even 
inherently faint objects can be detected. Early inventories of this type were 
drawn up by Hertzsprung, Luyten and Haas, who adopted limiting distances 
of 5pc, 10 pe and 15 pc, respectively. In 1951 Gliese began assembling a list 
of all objects thought to lie within 20 pc of the Sun. In 1957 he released a list 
of 915 systems, containing 1094 components, with distances of 20 pc or less; 
this first edition of the Catalogue of Nearby Stars is usually called the first 
Gliese Catalog. Since 1957 Gliese and his colleagues at the Astronomisches 
Rechen-Institut in Heidelberg have continuously updated and extended the 
Gliese catalog by searching the literature for objects believed to lie within 
first 22 pc and most recently 25 pc. By 1993 the list included 3264 systems, 
containing 3803 components (Gliese 1969, Gliese & Jahreiss 1979, Jahreiss 
& Wielen 1997). 

Since the Gliese catalogs are based on many different astrometric and 
photometric programs, that have no common selection criterion, it is ex- 
tremely hard to decide down to which magnitude the catalogs are reasonably 
complete, or to estimate the degree of incompleteness at faint magnitudes. 
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Figure 3.14 The distribution over 
absolute magnitude of stars in the 
1993 version of the Gliese cata- 
log (full histogram). The dotted 
histogram shows the distribution 
of stars that one would obtain if 
one used only trigonometric paral- 
laxes rather than a combination of 
trigonometric parallaxes and pho- 
tometric distances. [From Jahreiss 
(1993) courtesy of H. Jahreiss] 


The full histogram in Figure 3.14 gives some insight into these questions, 
however, by showing the number of stars of each absolute magnitude in the 
1993 version of the catalog. Comparison of this histogram with the lumi- 
nosity functions of Figure 3.12 makes it clear that the decline in numbers 
faintward of My ~ 12 is due to the catalog’s incompleteness; within 25 pc 
of the Sun there must be great numbers of stars with My ~ 15 and V < 16 
that have still to be discovered. 

Within 10 pc of the Sun one finds a sample of 8 white dwarfs, 2 subgiants, 
and 245 MS stars down to My = 18. This sample includes no O-B stars and 
only 4 A stars. It contains no giants or supergiants, and at least 50% of the 
stars are members of binaries or multiple systems. 


3.7 Interstellar dust 


The space between the stars is not empty. It is filled with rarefied but 
exceedingly filthy gas; if this gas were compressed to the density of ordinary 
air (that is, by a factor of ~ 101) the density of smoke in it would be such 
that objects would disappear into the haze at a distance of much less than a 
meter. 

Interstellar gas is so filthy because many stars are furnaces of the least 
environment-friendly type: vast quantities of hot gas stream out of every red 
giant, and smoke particles condense out of this gas as it streams away from 
the glowing surface of the star in exactly the same way that smoke particles 
form in flue gases as they come off a furnace. Some of these smoke particles 
will later gather together to form comets, asteroids and planets, but the vast 
majority hang around in interstellar space obscuring the view. For some 
reason astronomers call smoke in space, ‘interstellar dust’. Whittet (1992) 
is a useful general reference on interstellar dust. 

As we saw in Chapter 1, the realization by Trumpler in the 1920s, that 
dust prevents us seeing the Galaxy clearly, was one of the great milestones 
in astronomy. The key to his realization was the fact that dust not only 
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Figure 3.15 The reflection nebula around the star p Ophiuchi. 
[Courtesy of D. Malin] 


dims the light from distant stars, but it also reddens it. Dust accomplishes 
these things in two ways. First, it scatters some photons out of the line of 
sight from a star to the Earth. Second, it absorbs other photons, converting 
their energy into heat. In regions in which bright stars lie close to dense gas 
clouds, light scattered by dust can be directly observed; clouds from which 
scattered light can be detected are called reflection nebulae. Figure 3.15 
shows an example of such a reflection nebula. Evidence that the nebula’s 
light is indeed scattered starlight is provided by its spectrum, which shows 
the characteristic spectral features of the early-type stars that illuminate the 
nebula. 

The absorption of light by dust constitutes an important energy source 
for the interstellar medium and has the overall effect of making many galaxies 
emit most of their energy in the far infrared (at A + 200m). Thus dust 
transforms blue light into infrared light. The essential physics is as follows. 
Most of the energy radiated by stars emerges at wavelengths that can be 
absorbed by dust grains, and, moreover, young, luminous stars generally live 
out their lives in regions rich in dust — they are born in dense clouds, and in 
their brief lifetimes cannot get far away from the cloud of their birth. Hence 
a significant fraction of the energy radiated by the stars in the Galaxy is 
absorbed by dust grains. 

Dust grains are effective absorbers of photons whose wavelength is 
comparable to, or smaller than, the characteristic size a of the dust grains, 
but poor absorbers of longer-wavelength photons. Since there are a great 
many dust grains larger than about 100nm, interstellar dust absorbs UV 
radiation most effectively. But very few dust grains are tens of microns in 
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size, so far-infrared radiation is able to pass through quite dense interstellar 
clouds. The absorption of starlight warms dust grains to + 10K. At this 
temperature they radiate significantly at \ ~ 200m, and photons of this 
wavelength can escape the Galaxy. 


3.7.1 Extinction and reddening 


Both scattering and absorption dim the light from distant stars. Astronomers 
refer to this dimming as ‘extinction’.’” As we saw in §2.3.3, the extinction 
Ax of a star in some waveband X is defined to be the difference between the 
observed X-band magnitude m(X) and the magnitude mo(X) that would 
be observed in the absence of dust: 

Ax = (m- mo) x. (3.58) 
The reddening or color excess E(X —Y) in some color X —Y is defined to 
be the difference between the observed color m(X) — m(Y) and the intrinsic 
color mo(X) — mo(Y): 


E(X -Y) = [m(X) - m(¥)] — [mo(X) — mo(¥)] = Ax — Ay, (3.59) 


where the second equality follows from (3.58). The most often cited extinc- 
tion is Ay, and the most often cited color excess is E(B—V). Since colors are 
always defined such that the shorter waveband is on the left and the strength 
of interstellar extinction generally decreases from short to long wavelengths, 
color excesses are usually positive. 

We have seen that it is sometimes possible to predict the absolute mag- 
nitude Mx of a star from a knowledge of its MK spectral class. Fortunately, 
the latter may be determined irrespective of moderate extinction, since it 
follows from details of the star’s spectrum which are not significantly mod- 
ified by the presence of dust along the line of sight. Combining equations 
(2.36) and (3.58), we have 


mx = Mx+Ax+5logd—5, (3.60) 


so, if the distance d to a star of known spectral type is known, Ax follows im- 
mediately from the measured value of mx. Similarly, measurements of stars 
of known MK class and therefore of known intrinsic colors in two wavebands 
X,Y yield E(X —Y) directly from equation (3.59). 

In practice, it is usually necessary to determine Ay from observations 
of a star of unknown distance. In such a case, we assume that Ax tends to 


” Frequently the words extinction and absorption are used interchangeably, but in view 
of the clear distinction between absorption and scattering, this confusion of terms is to be 
avoided. 
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zero at long wavelengths as Ax = Aof(Ax), where Ao is a constant to be 
determined and f is a theoretically-inspired function, and x is the effective 
wavelength of our wave-band.® Typically f(A) « A718 at the longest wave- 
lengths for which observations are available. Once a form has been adopted 
for f(A), we seek the values of the two numbers Ao and log d that minimize 


S~ {mx — [Mx + Aof(Ax) +5logd — 5] }’, (3.61) 
xX 


where the sum extends over several different wave-bands X. 

The classic studies in the UBV system are those by Hiltner & Johnson 
(1956) and Johnson & Morgan (1953). Many references to more recent work 
will be found in the reviews of Savage & Mathis (1979) and Whittet (1992). 
Most studies focus on O-B stars, which are ideal for determining extinction 
and reddening because (i) they are highly luminous and hence can be seen to 
large distances, over which the column density of dust can become significant; 
(ii) they are intrinsically blue and hence strongly susceptible to reddening; 
and (iii) they have distinctive spectra that can be classified with high preci- 
sion. Figure 3.16 shows how stars of identical MK class are distributed along 
a reddening line in the two-color diagram. The bluest stars on this line 
are taken to be unreddened and these define the intrinsic two-color sequence 
given in Tables 3.10 and 3.11 and in Figure 3.7. More reddened stars move 
down the line to larger values of (U — B) and (B — V). This procedure can 
be applied to a group of stars of any spectral type to determine both the 
intrinsic color and the reddening line for the type. 

The amount of extinction and reddening along a line of sight depends on 
the density of dust along it. But in §8.1.6 we shall see that if the type of dust 


8 In the notation of §2.3.4, the effective frequency of a filter X when observing an ob- 
ject with absolute energy distribution f, is defined by veg = fav vSx(v)fr/ fav Sx(v) fr. 
The effective wavelength is A\eq¢ = c/Ver- 
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were everywhere the same, the ratio of the extinctions at any two frequencies 
would be the same for all lines of sight. The full curve in Figure 3.17 shows, 
for a representative line of sight, the ratio A)/A, of the extinction at wave- 
length » to that in the Johnson J-band. This curve is called the extinction 
curve or extinction law for that line of sight. A, peaks in the far-UV, near 
73nm. Shorter wavelength radiation (X-rays) passes right through grains, 
while much longer wavelength radiation refracts around them. 

At the longest wavelengths plotted (100m), A, is decreasing with in- 
creasing \ as ~ \~!. In most of the optical and UV regions of the spectrum, 
A) is falling slightly more steeply with increasing X. This steady fall is bro- 
ken by three distinct bumps. One in the UV centred on 217nm, and two 
in the IR, at 9.7m and 18ym. The origin of these bumps has not been 
certainly established, but it is thought that the UV bump is due to tiny 
pieces of graphite — these objects probably contain only ~ 50 atoms. The IR 
features are probably associated with Si-O bonds in silicate grains. 

In the visual band and at shorter wavelengths, the extinction curve 
is found to vary from one line of sight to another. The dashed curve in 
Figure 3.17 indicates the sense of this variation: along some lines of sight 
the extinction does not increase into the UV as strongly as along ‘standard’ 
lines of sight. The origin of this effect is thought to be variations in the size- 
distribution of grains in different regions: the extinction rises strongly into 
the UV along lines of sight rich in very small grains, which have negligible 
effect on red and IR radiation. Some lines of sight appear to be deficient in 
such small grains and have less steeply rising extinction curves.® 

The slope of the extinction curve near the V band is Ay /(A; Ry), where 


Ay Ay 


Ags dy EB=V): 


(3.62) 


° We discuss this phenomenon further in §8.1.6. 
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Consequently, Ry quantifies whether the extinction curve is rising steeply 
into the UV or not: Ry ~ 3 for a steeply-rising extinction curve, and Ry ~ 5 
for a slowly rising one. Figure 3.18 shows the distribution of Ry for 29 lines 
of sight studied by Cardelli, Clayton & Mathis (1989). Smaller values of 
Ry generally apply for lines of sight that do not penetrate deeply into a 
molecular cloud. 
The classical studies of extinction (e.g., Schultz & Wiemer 1975, Sned- 
den et al. 1978) derived 
Ry ~3.1 (3.63) 


for lines of sight that do not pass through dense clouds. Equation (3.63) 
is widely used because it allows one to estimate Ay from the much more 
readily measured quantity E(B —V). Unfortunately, these studies did not 
derive Ry for individual lines of sight but, for each of a series of wavebands 
X, found the slope of the best fitting line to the positions of the program 
stars in the (E(V — X), E(B —V)) plane. These slopes were then plotted 
against ae: and extrapolated to 4 = 0 to yield the mean value of Ry. This 
procedure does not allow one to determine the scatter in values of Ry for 
individual lines of sight and thus determine the likely error associated with 
equation (3.63). The scatter would certainly be smaller than that shown in 
Figure 3.18, which is for a sample of lines of sight chosen for their anomalous 
UV extinction curves, but it is probably not negligible. 

Cardelli, Clayton & Mathis (1989) show that along any line of sight 
A)/Aj, can be well approximated by 


Ay _ b(A) 
A aa) +), (3.64) 


where a and b are polynomials in \~?. 
As Figure 3.17 illustrates, the extinction curve is universal in the R band 
and redward from it, but variable at V and blueward of V. This fact implies 
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Table 3.21 The standard interstellar 
extinction law 


E(X-V) Ax 


Band X E(B-V) ae 
U 1.64 1.531 
B 1.00 1.324 
V 0.00 1.000 
R —0.78 0.748 
I —1.60 0.482 
J —2.22 0.282 
H —2.55 0.175 
K —2.74 0.112 
L —2.91 0.058 
M —3.02 0.023 
N —2.93 0.052 


SOURCE: From data published in Rieke 
& Lebofsky (1985) 


that extinctions and reddenings should be normalized to a waveband that 
lies to the red of V, which has traditionally been used as the normalizing 
band. 

In §8.1.6 we shall see that along any line of sight E(B — V) is ap- 
proximately proportional to the column density Ny of interstellar hydrogen 
atoms, irrespective of whether the hydrogen atoms are in the form of HI or 
are locked up in Hz molecules. Specifically, one has (Bohlin et al. 1978, Kent, 
Dame & Fazio 1991) 


Nu 
5.8 x 1025 m-2" 


E(B-V)= (3.65) 
E(B — V)/Ny is called the dust/gas ratio. Its constancy down different 
lines of sight suggests that a fixed number and size distribution of dust 
grains is associated with a given mass of interstellar hydrogen. In §9.2 we 
shall find that a typical value for the number density of H near the Sun is 
ny = 10°m~?, so along a line of sight of length d, the column density of H 
is Ny & 3.1 x 10?°(d/kpc) m~?. With equations (3.62), (3.63) and (3.65) it 
then follows that along a typical line of sight through the disk we have 


E(B—V)~ 0.53(;"-) and Ay ~ 1.6(,".). (3.66) 


Table 3.21 expresses the extinction law for ‘standard’ lines of sight in 
terms of the standard UBV RI wavebands. From the numbers in this table 


it may not be evident just how much more transparent the ISM is in the near 
infrared than in the blue. So consider the case of a star that is located at 
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the Galactic center. For such a star we might have Ay ~ 0.6, and therefore, 
according to Table 3.21, Ag ~ 34.5. The probability that a M-band photon 
reaches us from such a star is 10~°-?4 = 0.57, while the probability of a 
B-band photon reaching us is only 107158 = 1.6 x 10714. 

Extinction is caused by a combination of absorption and scattering of 
radiation by dust. The fraction of extinction which is caused by scattering 
is called the albedo, y,. For example, suppose a star that emits luminosity 
L at wavelengths in the range (\ + dA, A) is observed through an optically 
thin cloud which generates extinction A), and that the cloud as seen from 
the star subtends solid angle Q < 47. The luminosity scattered by the cloud 
is Q 

(1 — 107 44/28) (=), (3.67) 


while the luminosity absorbed by the cloud is 


(1 — y)(1 — 1074925) (=)... (3.68) 
An 

Dust grains are not expected to scatter radiation isotropically; more ra- 
diation will be scattered through small angles than large angles, so forward 
scattering predominates, especially in the case of large dust particles. In 
principle, one would like to determine the full angular dependence of scat- 
tering. In practice, the most that can be hoped for is a reliable estimate of 
the mean cosine of the scattering angle, which is generally denoted g). 

y, and gy have been determined from studies of (i) reflection nebulae, 
and (ii) ‘diffuse Galactic light’. It is relatively straightforward to determine 
the amount of light scattered by a reflection nebula, but very hard to de- 
termine the angle through which it is scattered because this angle depends 
sensitively on the location of the scattering dust relative to the illuminating 
star. ‘Diffuse Galactic light’ is light scattered by diffuse interstellar dust from 
the general interstellar radiation field of the Milky Way. It is of low surface 
brightness, so its intensity can be determined only by carefully correcting the 
surface brightness of an apparently blank piece of sky for the other contribu- 
tions — see §4.2. The advantage of using diffuse Galactic light to measure 
and g) is that the relative positions of the scattering grains and illuminating 
sources are then well determined. The night-sky brightness measured at 3 AU 
by the Pioneer 10 spacecraft lead to estimates y(440nm) = 0.61 + 0.07 and 
g(440 nm) = 0.6 + 2 (Witt 1988). Scattering should become more isotropic 
(g, — 0) as A increases, while -y, is believed to decrease with A. 


Reddening-free indices Thus far, we have assumed that the spectral 
types of the stars under study are known, so that we can determine color 
excesses by comparing the observed colors of each star with those of an 
unreddened star of the same (known) type. However, in many cases, we do 
not know the spectral types of the stars that we wish to study. Nevertheless, 
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Table 3.22 Q versus spectral type for 
early-type stars 


Spectral type @Q  Spectraltype Q 


05 —0.93 B3 —0.57 
O06 —0.93 B5 —0.44 
08 —0.93 B6 —0.37 
O9 —0.90 B7 —0.32 
BO —0.90 B8& —0.27 
BO.5 —0.85 B9 —0.13 
Bl —0.78 AO 0.00 
B2 —0.70 


SOURCE: From data published in Johnson & 
Morgan (1953). 


given the slope of the reddening line, it is possible to define a photometric 
parameter that depends only on the spectral type of a star and is independent 
of the amount of reddening. In the UBV system, this parameter is (see 
Problem 3.8) 


E(U — B) 
E(B -V) aw) (3.69) 
~ (U — B) —0.72(B —V). 


Q=(U-B)- 


Values of Q for stars of spectral types O through AO are listed in Table 3.22. 
The importance of this parameter is that, for early-type stars, it uniquely 
determines a star’s intrinsic color from photometric data alone without the 
need for spectra. Furthermore, Q can be measured easily for stars that are 
much too faint for spectral classification. From the data given in Table 3.10 
and Table 3.22, one finds that, to a good approximation, 


(B —V)o = 0.332Q. (3.70) 


To summarize, we can determine (U — B)y and (B — V)o for early-type stars 
from Q. Once (B — V)p is known, we find E(B — V) from (B — V), and, as 
we shall see later, the visual extinction Ay, (or Ax, at any other waveband 
X) follows from E(B —V). 

For spectral types later than AO, @ is not a unique function of spectral 
type [see Strand (1963), Chapter 13, for the full dependence of Q on spectral 
type], and hence it is no longer useful for estimating the amount of reddening 
present. This failure occurs because the reddening line happens to have 
almost the same slope as the unreddened MS curve for late-type stars, shown 
in Figure 3.16, which means that it is nearly impossible to tell (from UBV 
data alone) whether a given star has been reddened or is unreddened but of 
a later type. 
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Finally, it is worth noting that reddening-free indices can be defined 
in other photometric systems as well. For example, in the Stromgren uvby 
system, we can define 


[ex] = c1 — 0.2(6 — y) 
[my] = m1 + 0.18(b — y) 
[u — b] = (u— b) — 1.84(b — y) 
= [ey] + 2[ma], 


(3.71) 


from which intrinsic stellar properties can be inferred in certain ranges of 
spectral types. 


Polarization of starlight by dust Light from obscured stars tends to 
be slightly polarized, and the degree of this polarization tends to increase 
with Ay. The origin of this phenomenon is believed to be alignment of dust 
grains by a magnetic field B that permeates interstellar space (see §8.1.5 for 
details). The alignment of dust grains with B is believed to be caused by 
the Davis—Greenstein effect, which causes spinning dust grains to align 
their spin axes with B. When a grain’s spin axis is not parallel to B, the 
direction of B relative to axes fixed in the grain changes as the grain rotates. 
Dissipative effects, such as hysteresis and eddy currents, then torque the 
grain in such a way that the grain’s spin axis moves closer to B. Since grains 
tend to spin about the axis that has the greatest moment of inertia, the 
net result of the Davis—Greenstein effect is to preferentially align the grains 
perpendicular to B. When an electromagnetic wave passes a grain, the wave’s 
electric field E will be in some general direction. Since the grain is much more 
readily polarized parallel to its long axis, the grain responds most readily to 
the component of E in this direction. Consequently, radiation scattered or 
absorbed by the grain tends to be polarized parallel to the grain’s long axis 
and hence perpendicular to B. Conversely, transmitted radiation tends to 
be polarized parallel to B — for more details see §4.4 of Whittet (1992). 

The Davis—Greenstein effect enables us to map out the direction of the 
interstellar magnetic field by measuring the polarizations of large numbers 
of stars. Axon & Ellis (1976) gathered from the literature polarization di- 
rections for 5064 stars with reliable distances and produced plots like those 
shown in Figure 3.19 of the directions of B as a function of Galactic longitude 
and latitude for several distance ranges. The more distant stars show that, 
averaged over kpc-scales, B usually points within the plane, while the po- 
larization vectors of relatively nearby stars suggest that plumes of magnetic 
field rise tens of parsecs above the plane in a way reminiscent of magnetic 
loops observed in the chromosphere of the Sun. 


3.7.2 Extinction of sight-lines out of the Galaxy 


Burstein & Heiles (1982) have mapped E(B —V) over most of the sky that 
lies outside the zone of avoidance (|b| < 10°) — see Figure 3.21. Their values 
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Figure 3.19 The direction of the Galactic magnetic field as traced by the polarization 
of light from stars with distances between 100 pc and 200 pc (bottom) and between 1 and 
1.5kpce (top). The length of each line is proportional to the fractional polarization of the 
light from a star according to the scale indicated in the top-left corner of each panel. [After 
Axon & Ellis (1976)] 


Figure 3.20 Path length through 

a homogeneous layer of dust in the 
Galactic disk. Perpendicular to the 
Plane the layer has half-thickness h. 
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180 : 90 0 ~90 —180 


Figure 3.21 Contours of equal E(B — V) on the sky away from the Galactic plane 
(|b]| > 10°). The contour levels are uniformly spaced in E(B — V) from 0.025 to 0.75. 
Burstein & Heiles (1982) obtained these data by combining galaxy counts with column 
densities of neutral hydrogen measured from 21-cm line profiles. [After Burstein & Heiles 
(1982) from data kindly provided by D. Burstein] 


of E(B — V) were obtained by exploiting two phenomena: (i) extinction 
makes external galaxies fainter and therefore reduces the number of galaxies 
counted in any zone to some given limiting magnitude; (ii) extinction is often 
associated with neutral hydrogen that can be detected through its 21-cm line 
(see §8.1.4). Neither of these methods by itself provides a reliable estimate 
of the extinction along a given line of sight. One problem with the first 
method is that galaxies cluster, so an enhancement or deficit in the number 
of galaxies counted may be unconnected with extinction. The second method 
has the problem that, along a given line of sight, ionized and/or molecular gas 
may make significant contributions to the total column density of interstellar 
material, and these contributions are not detectable through the 21-cm line. 
However, Burstein & Heiles (1978) have shown that these weaknesses can 
be largely overcome by combining the two approaches. Comparison of their 
values of E(B —V) with values derived from Stroémgren ubvy photometry of 
stars indicates that their values are accurate to 0.01 mag or 10%, whichever 
is the larger. The reddening at the Galactic poles is ~ 0.01 mag, comparable 
to the overall uncertainty. 

Figure 3.21 shows that, although the extinction along lines of sight out of 
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Box 3.2: Galaxy Counts 


Hubble (1936) modeled the distribution of interstellar dust by a slab of 
half-thickness h — see Figure 3.20. In this model, a line of sight from 
the plane out of the Milky Way along latitude b passes for a distance 
hescb through the slab. Hubble assumed that Ay is proportional to this 
distance [see equation(8.48)], so 


Ay = hi cscb, (1) 


where h; is some constant. Now imagine a population of galaxies of 
absolute magnitude M. All those that lie within distance d will have 
apparent visual magnitudes brighter than m, where 


m=M+5logd—5+ Ay (2) 


If these galaxies are uniformly distributed in space with number density 
nm, there will be ZN n m such galaxies in a cone that subtends solid 
angle 62. The number N(m) counted per unit solid angle to apparent 
magnitude m thus satisfies 


log N = 3logd + constant = 3(m ~ Ay — M) + constant 
= 0.6(m — Ay) + constant. 


Combining this with equation (1), we have 


log N = 0.6(m — hj csc 6). (4) 


Thus h, can be deduced from the slope of a plot of the observed number 
counts log N(m) against cscb. From equation (1), hy is the V-band 
extinction at the Galactic pole. 


the Milky Way generally diminishes as |b| increases, it varies very irregularly 
over the sky. Away from the direction to the Galactic center, E(B—V) tends 
to be small in the North at b > 20°, while in the South there is a great spur 
of high E(B —V), which reaches to b = —40° at 1 = 170°. The asymmetries 
between the maps for the northern and southern hemisphere arise, at least 
in part, because the Sun lies slightly above the Galactic plane (§10.1). 


Problems 


3.1 Prove that, if the orbital planes of binaries are oriented randomly with 
respect to the plane of the sky, then the average value of sin? i for the binaries 
is (sin® i) = 0.59. 


3.2 Show that, if the surface of a pulsar were at the same temperature as 
the photosphere of the Sun, the pulsar would have My ~ 30. 
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3.3 Show that a star’s effective temperature, T.g, can, in principle, be 
determined without knowing its distance by expressing Teg as a function of 
the star’s bolometric magnitude, mp 1, and its angular diameter, @. This 
technique was used by Ridgway et al. (1980) to compile the data for giants 
listed in Table 3.6. 


3.4 Evaluate the difference between the mean luminosities of (i) a magnitude- 
limited sample and (ii) a volume-limited sample of objects with luminosity 
function given by equation (3.32). Assume that the population is homoge- 
neously distributed in space. 


3.5 Show that, if the distribution of stars in space is uniform, the distri- 
bution of stars over absolute magnitude in a magnitude-limited survey is 
independent of the survey’s limiting magnitude. 


3.6 The effective volume of a survey is the ratio of the number of objects 
reached by the survey to the local space-density of the objects. Consider 
the case in which the space density of stars is proportional to exp(—|z|/h), 
where |z| is distance from the Galactic plane and h is the ‘scale-height’ of the 
density distribution (cf. §4.4.3). Assuming that we are located in the Galactic 
plane, show that the effective volume of a magnitude-limited, all-sky survey 
is then 
Veg = 4rh3 (4a? + (x + 1)e™* - 1], (3.72) 
where zx is related to the absolute magnitude M of the surveyed objects and 
the limiting magnitude of the survey, Mim, by 
ai(h/pe) = 10°2Gre—M)+, (3.73) 
By evaluating Veg for h = 300pc, M = —6,—5,... and my = 2.5, 6 repro- 
duce the numbers in Table 3.17. 
Explain why Malmquist bias is more pronounced for mm = 2.5 than 
for Mim = 6. 
3.7 From a study of the red clump in Baade’s window Stanek (1996) con- 


cludes that Ay /E(V — I) = 2.49+ 0.02. Is this datum compatible with the 
numbers cited in §3.7.1? 


3.8 Verify that the quantity Q defined by equation (3.69) is reddening 
independent. 


3.9 Equation (1) of Box 3.1 defines the generic inverse problem. Appendix C 

describes the Richardson—Lucy algorithm for the solution of such equations. 

This algorithm requires that the functions f,g and K satisfy equation (C.1). 

In case f,g and K do not satisfy this equation, show that the functions 
f(z) K(z,€) 


fl =——,g = = an K x = —— 
fla) =, HG) = z9(é)kolE) and K(x, 6) = | (3.74) 


do, where 


es / defiant i= ‘, dx K(c,€). (3.75) 


4 
Morphology of Galaxies 


In this chapter, we begin our study of external galaxies by discussing their 
shapes and something of what can be learned of their composition from the 
spectrum of the light they emit. In §4.1 we discuss the morphological clas- 
sification of galaxies, that is, the classification of galaxies into types, either 
on the basis of their gross appearance, or on the basis of their integrated 
spectra. The morphological type of a galaxy turns out to be correlated with 
whether that galaxy resides in a high- or a low-density region of the Uni- 
verse. In §4.1.2 we review the connection between a galaxy’s environment 
and its morphology. Galaxies vary enormously in their sizes and luminosi- 
ties. The least luminous galaxies can only be observed if they happen to 
lie very close to us. The Milky Way is the second most luminous member 
of a group of about 35 galaxies that is called the “Local Group”. Most of 
the luminosity in the Universe is believed to come from galaxies which are 
members of similar galaxy groups. Consequently, studies of the Local Group 
are important and we review them in §4.1.4. It turns out that we can put 
much of the morphological classification of galaxies on a quantitative basis by 
measuring the surface-brightness profiles of galaxies. In §4.2, the techniques 
involved in such measurements are described. In §§4.3 and 4.4 we discuss the 
brightness distributions that seem to characterize the various morphological 
classes defined in the previous section. In particular, we present empirical 
laws which approximately describe the radial brightness distributions of the 
various components of galaxies, and we explore what information can be 
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Figure 4.1 An illustration of how the appearance of a galaxy can depend on how its 
image is displayed. Both panels depict a CCD frame of the peculiar galaxy NGC 5128, 
which was obtained in a 10 minute exposure on a 10-inch telescope. Both are shown as 
negative images, but the adopted grayscales are very different. [From data kindly provided 
by J. Torres (http: //www.dogtech.com/ccdast/)] 


gleaned about the three-dimensional forms of galaxies from their observed 
surface-brightness distributions. Most galaxies are attended by an entourage 
of globular clusters. In §4.5 we discuss these populations and seek correla- 
tions between the characteristics of a given galaxy’s morphological type and 
its globular cluster population. A small fraction of all galaxies do not fit 
comfortably into the standard classification scheme introduced in §4.1. In 
§4.6 we discuss such galaxies, many of which are believed to be objects that 
have been more or less violently disturbed by an encounter with another 
galaxy. 


4.1 Morphological classification of galaxies 


When classifying galaxies on the basis of their images, one has constantly 
to bear in mind that galaxies are not hard-edged objects of the type that 
surround us in our daily lives. Indeed, galaxies fall far outside the range 
of our ordinary experience as regards both their self-illumination and their 
ghost-like insubstantiality. Figure 4.1 illustrates just how misleading pictures 
in books and journals can be. Here one sees two images of the same galaxy 
on the same scale and in the same orientation. Yet, because of the different 
depths of the two images, the appearance of the object is so different that one 
may feel a degree of disbelief that both illustrations are of the same system. 
Similar differences arise between two images in different wavebands, and 
between spectra of very different resolutions: a feature that is conspicuous 
in blue light may totally vanish at longer wavelengths, and a narrow spectral 
line that is prominent in a high-resolution spectrum may be undetectable in 
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a lower-resolution spectrum. Therefore, when classifying objects one has to 
try to avoid being misled by superficial appearances that are little more than 
optical illusions, and one must seek quantitative confirmation of qualitative 
impressions wherever possible. 

Imagine that you have collected a library of galaxy images in some way. 
If your collection is a large one, it will contain objects with a wide range of 
appearances. When confronted with a set of complex objects, a natural first 
step is to identify classes of similar objects. So you look carefully at each 
image and notice features that some galaxies have in common and others do 
not. Eventually, after much consideration you arrange the images in piles, 
one for each of your classes. Finally, you may decide that some classes are 
closely related to one another, while others are not. For example, the galaxies 
of classes a, b and c might each show spiral structure, while those of class e 
might not. Moreover, the objects of class b might be intermediate between 
those of classes a and c in that their spiral arms are intermediate between 
the open arms of class c objects and the tightly-wound arms characteristic 
of class a objects. 

This example illustrates several fundamental considerations. First, dif- 
ferent libraries of galaxy images are liable to lead to the identification of 
different classes. For example, if none of the images shows spiral structure, 
the final classification system is likely to focus on features that are apparent 
in spiral-free images, such as the degree of central concentration of the light, 
or ellipticity of the image. Similarly, if the library contains images of each 
object in several wavebands, the colors of an objects may play an impor- 
tant role in the classification scheme, whereas it clearly can play no role if 
the library contains only one image per object. The classifications schemes 
to be described below are based on a small number of catalogs! for which 
galaxies were selected on the basis of (i) their apparent magnitudes and (ii) 
their typical surface brightness in the B band. It is now known that these 
catalogs exclude large numbers of objects, some of which may be important 
for the structure of the Universe (Impey et al. 1996). 

The second fundamental consideration is that it makes no sense to com- 
pare, for example, a B-band image of one galaxy with a R-band image of 
another. Nor to compare a shallow image of one galaxy with a deep image of 
another. In short, it is essential that the images in one’s library be homoge- 
neous in the sense that a similar set of images is available for every object. If 
this criterion is not met, objects that are actually nearly identical are liable 
to be assigned to very different classes because, for example, one object was 
imaged in a band that enhances spiral structure and the other was imaged 
in a band that obscures it. This problem is particularly acute if one wishes 
to compare the morphology of a galaxy observed at large redshifts with that 
of a nearby galaxy: for this comparison one requires images in wavebands 


1 Principally the Shapley Ames Catalogue of Bright Galazies (Sandage & Tammann 
1981). 


Table 4.1 General galaxy classification schemes 


System Principal criteria Symbols Examples 
Hubble-Sandage! __ barrishness; E, SO, S, SB, Irr M87=E1 
openness of arms/disk-bulge ratio; a, b,c M31=Sb 
degree of resolution of arms into stars M101=Sc 
LMC=rIrr I 
De Vaucouleurs? barrishness; E, S0, S, SA, SB, I M87=E1P 
openness of arms/disk-bulge ratio; a, b, c, d, m M31=SA(s)b 
rings or s shapes (r), (s) M101=SAB(rs)cd 
LMC=SB(s)c 
Yerkes? central condensation of light; k,g,f,a M87=kE1 
barrishness/smoothness E, R, D,S, B, I M31=kS5 
M101=f$1 
LMC=afl2 
DDO* young-star richness of disk; E, SO, A, S, Ir M&7=E1 
barrishness; B M31=Sb I-II 
central condensation of light; a, b, ¢ M101=Sc I 
quality and length of arms I, Il,.., V LMCe=rIr II-[V 


NOTES: ’Sandage (1961-1995), 2de Vaucouleurs (1959b), 3Morgan (1958-1970), *van den Bergh (1960-1976) 
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that are similar in the rest frames of the two galaxies. For example a R-band 
image of a galaxy at redshift z = 1 should be matched by a U-band image 
of the nearby galaxy. Few such images are currently available. 

The third fundamental consideration is that the chosen classification 
scheme is likely to reflect one’s views as to what is ‘physically important’ and 
what is ‘merely superficial.’ Since morphological classification is an early step 
taken towards understanding the objects in an image library, these prejudices 
are likely to be mere hunches; it would not be surprising if it later emerged 
that some features that were originally dismissed as ‘superficial’ are in fact 
fundamental, and vice versa. 

The fourth fundamental consideration is that the criteria used to assign 
a galaxy to a class should permit a unique classification in each case. In 
particular, if more than one criterion is employed, it is important to be clear 
in what order the criteria are to be applied lest the different criteria lead one 
to place a given galaxy into more than one class. 

Table 4.1 lists four galaxy-classification schemes that have been devised 
and more or less widely employed. These schemes have been developed 
from libraries of B- or photographic-band images. Hence colors do not play 
a defining role. The defining libraries are designed to include all galaxies 
brighter than some given apparent magnitude, or having isophotal diameters 
in excess of some given angle. We shall see in §4.1.3 that these criteria 
effectively ensure that the schemes are defined in terms of rather luminous 
galaxies, that are actually not at all typical of the generality of galaxies in 
the Universe. Most of these schemes employ more than one classification 
criterion in violation of one of the principles discussed above, with the result 
that the classification of a given galaxy is occasionally ambiguous. 

Most published galaxy classifications are the result of a subjective as- 
sessment of a picture by a human being. Do different astronomers place a 
given object in the same class when they work from identical material? Naim 
et al. (1995) investigated this question by getting six astronomers to classify 
831 galaxies. Much of the time two astronomers placed the same galaxy in 
different classes. However, in such cases the two classes were invariably very 
similar ones — adjacent ones when the classes are placed in an ordered linear 
sequence.? This result implies that, while the classification scheme used is 
fundamentally sound, one should not take a galaxy’s exact classification too 
seriously. 


4.1.1 The Hubble sequence 


The most widely used classification scheme is that introduced by Hubble 
(1936) in his book The Realm of the Nebulae, and subsequently modified as 
his collection of large-scale plates of giant galaxies grew. Reproductions of 
many of these plates were published posthumously in 1961 by Sandage as 


2 The RMS dispersion in Hubble T type defined by Table 4.2 was typically 1.8. 
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Figure 4.2 The Hubble tuning-fork diagram. This figure differs 
from Hubble’s original diagram in that it shows stages of lenticular 
galaxies interposed between the ellipticals and the spirals. 


The Hubble Atlas of Galaxies, alongside an introduction by Sandage which 
is generally regarded as the definitive exposition of Hubble’s scheme. In 
1994 the Hubble Atlas was superseded by the much more extensive Carnegie 
Atlas of Galaxies (Sandage & Bedke 1994). Hubble classifications for all 
the galaxies in the Shapley-Ames Catalogue of Bright Galaxies are given 
by Sandage & Tammann (1981). Figure 4.2 illustrates the scheme in its 
final form. Hubble suggested that galaxies evolved from the left-hand end of 
this sequence to the right. This now discredited speculation lives on in the 
convention that galaxies that lie towards the left-hand end of the sequence 
are referred to as early-type galaxies, while those towards the right-hand 
end are referred to as late-type galaxies. 

On the left of Hubble’s tuning-fork diagram are placed those galaxies 
which appear smooth and structureless. These elliptical galaxies vary in 
shape from round to fairly highly elongated in form. A galaxy of this type is 
designated En, where the number n describes the apparent axial ratio (b/a) 
by the formula n = 10/1 — (b/a)]. Thus, a galaxy which appears round on 
the sky is designated EO, and one whose major axis is twice as long as its 
minor axis is and E5 galaxy. Figure 4.3 shows examples of each type of 
elliptical galaxy, ranging from the round EO galaxy NGC 3379 through the 
highly elongated (E6) system NGC 3377 — few, if any, ellipticals are more 
elongated than E6. 

The galaxies shown in Figure 4.3 are all highly luminous objects. Most 
elliptical galaxies are much less luminous, and are designated dE for dwarf 
elliptical galaxy. Three well-studied dE galaxies in the Local Group are 
NGC 147 (dE3), NGC 205 (dE5) and M32 (dE2). Figure 4.4 shows a photo- 
graph of NGC 147. Notice the low surface-brightness envelope of the system 
against which many foreground stars are conspicuous. The Local Group also 
contains a large number of dwarf spheroidal galaxies (dSph). In these 
systems the star density is everywhere so low that on images obtained with 
even the largest ground-based telescopes, they appear as mere clusterings of 
faint (but intrinsically moderately luminous) stars. No smooth background 
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Figure 4.3 Four elliptical galaxies. From top left to bottom right NGC 3379 (E0), NGC 
4261 (E3), NGC 4486 = M87 (EO), and NGC 3377 (E6). [Photographs from the Carnegie 
Atlas courtesy of A. Sandage] 


of light produced by less luminous stars is visible. The Sculptor system, dis- 
covered by Shapley (1938), was the first dSph galaxy to be discovered, but 
more of these systems are discovered each decade. Ground-based telescopes 
cannot detect dSph galaxies that lie significantly outside the Local Group. 
In §4.3.4 we shall see that these systems may differ fundamentally from clas- 
sical E galaxies but form the low-luminosity tail of a sequence whose brighter 
members include dE galaxies such as NGC 205 but not M32. 

Globular clusters are not normally accorded the dignity of being termed 
elliptical galaxies, being considered mere appendages of more massive sys- 
tems like our Galaxy. But from a structural point of view, it is sometimes 
useful to regard them as a species of extremely low-mass, elliptical galaxy. 
Indeed, one finds that the stellar content of globular clusters is frequently 
quite similar to what one might expect to find in a very low-mass elliptical 
galaxy, and their density profiles prove to be intermediate in type between 
those of giant ellipticals and of dwarf spheroidals. Many of the globular clus- 
ters that are located far out in the galactic halo, for example the Palomar 
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Figure 4.4 The dwarf elliptical gal- 
axy NGC 147. [Photograph from 
the Carnegie Atlas courtesy of 

A. Sandage] 


clusters (Abell 1955), probably provide a link between the classical clusters, 
which tend to be tightly bound to giant galaxies, and the more massive 
but less centrally concentrated dwarf spheroidals. If one extends Hubble’s 
original elliptical class in this way, one finds that elliptical galaxies span an 
enormous range (at least six decades) of intrinsic brightness (see §4.1.3). 

After the elliptical galaxies, Hubble’s diagram bifurcates into two bran- 
ches ~ the ‘normal’ and the ‘barred’ galaxies. The word ‘normal’ should not 
mislead one into supposing these galaxies are more common than the barred 
species; the two types actually occur with similar frequencies. The reader 
should also beware of imagining that the division of galaxies into normal 
and barred type is clear and unambiguous. Most galaxies show some of the 
characteristics of barred galaxies, and it is only the more extreme examples 
that are classified as barred systems. Furthermore, barlike characteristics 
may be lost on small-scale plates, especially of nearly edge-on systems, so 
that misclassification in this respect is quite common. 

In the middle of Hubble’s tuning-fork diagram, at the junction of the el- 
liptical and the spiral galaxies, comes a class of galaxies known as lenticular 
galaxies. These galaxies are designated as type SO or type SBO according 
to whether or not they are barred. The SO galaxies are characterized by a 
smooth central brightness condensation (the bulge or spheroidal component) 
similar to an elliptical galaxy, surrounded by a large region of less steeply 
declining brightness. This latter component, which is generally rather struc- 
tureless (although it may sometimes contain some dust), is believed to be 
intrinsically rather flat. Indeed, when systems of this type are seen edge-on, 
one sees clearly that they are very flat in their outermost parts. Between 
the inner, elliptical-like bulge region and the outer (envelope) parts, many 
lenticular galaxies possess smooth subsidiary features called ‘lenses’ — see be- 
low. The family of SO galaxies is subdivided into three classes, $01, S02, and 
S03 galaxies, according to the strength of dust absorption within their disks. 
The S0; galaxies do not show any signs of absorption by dust, whereas the 
S03 galaxies have a complete dark band of dust absorption running within 
their disks around their elliptical-like components (see Figure 4.5). The 
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Figure 4.5 The normal lenticular galaxy NGC 4976 (S0,) at top left and the barred 
lenticular galaxy NGC 4371 (SBO) at bottom left. The top right panel shows a deep 
image of the normal lenticular galaxy NGC 6861 (SBO3). In the lower right panel a 
much shallower photograph shows the dust lane near the center of this same galaxy. 
[Photographs from the Carnegie Atlas courtesy of A. Sandage] 


strength of dust absorption in S02 galaxies is intermediate between these 
two extremes. The barred lenticular galaxies are also divided into three 
types, SB0;, SB0O2, and SB03, but here the division is made according to 
the prominence of the bar rather than the presence of dust-absorption lanes. 
In SBO; galaxies, the bar shows only as two broad regions of slightly en- 
hanced brightness on either side of the central bulge. In SBO3 galaxies, the 
bar is narrow and well defined and extends completely across the lens. SBO2 
galaxies have bars of intermediate prominence. 

In the Hubble sequence, the lenticulars are followed by the spiral galax- 
ies. A normal spiral galaxy comprises a central brightness condensation, 
which resembles an elliptical, located at the center of a thin disk containing 
more or less conspicuous spirals of enhanced luminosity, the spiral arms. 
A barred spiral has, interior to the spiral arms, a bar, often containing dark 
lanes believed to be produced by absorption of light by dust. The spiral arms 
of barred spirals generally emanate from the ends of the bar. 

Within each class of spiral, normal, or barred, a sequence of subtypes is 
identified by division according to a combination of three criteria (see Figures 
4.6 and 4.7): (1) the relative importance of the central luminous bulge and 
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Figure 4.6 Four normal spiral galaxies: at top left NGC 2811 (Sa); at top right NGC 
2841 (Sb); at lower left NGC 628 (Sc); at lower right NGC 7793 (Sd). [Photographs from 
the Carnegie Atlas courtesy of A. Sandage] 


the outlying disk in producing the overall light distribution of the galaxy; (2) 
the tightness with which the spiral arms are wound; and (3) the degree to 
which the spiral arms are resolved into stars and individual emission nebulae 
(HII regions). Properly speaking, this superposition of criteria is highly 
unsatisfactory, but it turns out that there is sufficient correlation among these 
three parameters — in the sense that galaxies with conspicuous central bulges 
tend to have tightly wound spiral arms that are not highly resolved into 
stars ~ that experienced observers independently classifying objects usually 
agree rather closely in their class assignments. Early-type spirals (that is, 
those more to the left in the Hubble diagram; see Figure 4.2) are those 
having conspicuous bulges and tightly wound, smooth arms. These spirals 
are designated Sa or SBa according to whether they are barred (SBa) or 
not (Sa).3 Late-type spirals, having small central brightness condensations 
(bulges) and loosely wound, highly resolved arms, are designated Sc and 
SBc. In between are Sb and SBb galaxies. Intermediate stages of central 
condensation and tightness of winding of the arms are designated Sab, Sbc, 
and so on. 


3 Transition cases between lenticulars and spirals are designated SO/a. 
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Figure 4.7 Four barred spiral galaxies: top left NGC 1291 (SBa); top right NGC 1300 
(SBb); lower left NGC 7741 (SBc); lower right the LMC (SBm). [Photographs from the 
Carnegie Atlas courtesy of A. Sandage] 


Table 4.2 Hubble stage T 


Hubble E E-SO SO S0/a Sa Sa-b Sb Sb-c Sc Sc-Irr IrrI 
de Vaucouleurs E SO~ S0° SO+ Sa Sab Sb Sbc Scd Sdm Im 
a -5 -3 -2 0 1. 2 3 4 6 8 10 


Unsymmetrical galaxies were assigned by Hubble to two classes of ir- 
regular galaxies. Irr I galaxies are objects that lack symmetry or well 
defined spiral arms and display bright knots that contain O and B stars. 
Irr II galaxies are unsymmetrical objects that have rather smooth images. 
They frequently display dust lanes. 

Later workers have generally considered Hubble’s original scheme satis- 
factory in regard to the ellipticals, but many have said that Hubble’s classifi- 
cation of the spirals is incomplete and that his treatment of irregular galaxies 
— of which more will be said presently — was quite inadequate. In particular, 
de Vaucouleurs (1959b) argued that Hubble’s division of the spirals does not 
do justice to the great variety of morphologies of real galaxies, and proposed 
a more elaborate classification. Most importantly, de Vaucouleurs extended 
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Figure 4.8 Images of galaxies containing: a) a nuclear ring (ESO 565-11); b) an inner 
ring (IC 5240); c) an outer ring (NGC 1543); and d) all three ring types (NGC 6782). 
Note that the images are displayed at different contrast levels to show the various features. 
[After Buta & Combes (1996) from data kindly provided by R. Buta] 


the prongs of Hubble’s tuning fork with additional classes Sd, Sm and Im. 
The Sd class overlaps Hubble’s Sc class to some extent, but it also contains 
some more extreme objects which were classified as Irr I galaxies in Hubble’s 
scheme. The Sm and Im classes contain the remaining galaxies of Hubble’s 
Irr I class — the ‘m’ stands for ‘Magellanic’ because the Large Magellanic 
Cloud (LMC) is classified SBm. Only very irregular and loose objects like 
the Small Magellanic Cloud (SMC) and IC 1613 are placed in the Im stage 
by de Vaucouleurs. 

De Vaucouleurs introduced the notation SA for an unbarred galaxy to 
match Hubble’s designation SB for barred systems; an SAB galaxy is weakly 
barred. In this system a lenticular galaxy is designated SO only if one cannot 
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tell whether it has a bar or not — for example because it is seen edge-on. 
De Vaucouleurs and subsequently Kormendy (1979) have emphasized the 
importance for morphological studies of the rings and lenses that often occur 
in disk galaxies, especially barred ones. Figure 4.8 shows some examples of 
these phenomena. In a barred galaxy a ring is classified as a nuclear ring, 
an inner ring, or an outer ring according to the ratio of its radius r to the 
length a of the bar’s semimajor axis: a nuclear ring has r/a < 1, an inner 
ring has r/a ~ 1 and an outer ring has r/a > 1. Thus inner rings are touched 
by the ends of the bar, while nuclear and outer rings are, respectively, smaller 
and larger than the bar. A lens is an elliptical (but not bar-shaped) feature 
that has a very shallow brightness gradient interior to a sharp outer edge, 
beyond which the brightness gradient is steep. Thus a lens may be thought 
of as a filled ring. Lenses frequently contain a bar, each end of which touches 
the lens’ perimeter. De Vaucouleurs uses the symbols (r) and (s) to indicate 
systems with and without rings. 

Table 4.2 indicates the correspondence between the Hubble and de Vau- 
couleurs schemes, and also defines the numerical Hubble stage T that is 
useful in some quantitative studies. 

Elmegreen & Elmegreen (1982) divided galaxies into 12 arm classes 
according to the morphology of their spiral arms. Galaxies in arm class 1 
have chaotic, fragmented, unsymmetrical arms, while galaxies in arm class 12 
have two long symmetric arms that dominate the optical disk. These classes 
correlate with luminosity and, in fact, a simpler predecessor of this scheme 
formed the basis for a now obsolete distance indicator (van den Bergh 1960). 


4.1.2 Effects of environment 


The wide range of galaxy morphologies raises the issue of what causes this 
diversity. One clue to finding an answer to this question has come from 
studies of the environments in which galaxies of different types are found. In 
particular, as soon as reliable observations of galaxy types were made, it was 
recognized that galaxies in clusters are much more likely to be ellipticals or 
SOs than are those in the field (e.g. Hubble & Humason 1931). This finding 
suggests that environmental factors play an important role in determining 
the morphology of a galaxy. 

Not all clusters are the same, however: Oemler (1974) found that as 
many as 40% of the galaxies in some clusters are elliptical, while in others 
the proportion of ellipticals is more like 15%. This elliptical fraction, f(E) 
was found to correlate with the morphology of the cluster, in that a cluster 
with a large value of f(E) tends to have a regular, symmetric appearance, 
often with a giant cD galaxy (see §4.3.1) at its center, while a cluster with a 
small value of f(E) generally has a ratty appearance. 

Oemler’s study also provided the first quantitative evidence that the 
balance of morphological types varies within individual clusters. The pro- 
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jected number density of galaxies in a centrally-concentrated cluster de- 
creases monotonically with radius R, but Oemler discovered that the pro- 
jected density of spirals actually increases with R in the inner parts of such 
a cluster. Thus, the fraction of spirals, f(Sp) in these systems increases 
with R from very small values at the centers. This result is particularly 
striking because we are only measuring the projected density of galaxies, 
that is, the average density along lines of sight through the cluster. When 
proper allowance is made for this fact, the observations are consistent with 
there being essentially no spiral galaxies in the cores of regular clusters; all 
of those that are seen at small projected radii probably lie at large dis- 
tances from the cluster’s center. This discovery provided the first example 
of a morphology-radius relation in clusters. Similarly, for several clusters 
Melnick & Sargent (1977) compared the ratio of the density of spiral galaxies 
to that of SO galaxies, and found that the SOs become increasingly dominant 
at small radii. Studies at other wavelengths show similar trends: radio stud- 
ies of HI in spiral galaxies (see §8.2.1) have found that those few spirals that 
are seen projected close to the center of a cluster contain significantly less HI 
than do spirals at larger R, and the average HI deficiency is found to vary 
monotonically with R (Magri et al. (1988), and references therein). 

The spatial segregation of galaxy types in clusters should give rise to 
measurable differences between the kinematics of galaxies of different types. 
Indeed, the reason spiral galaxies lie further from the centers of clusters than 
do ellipticals must be that spirals follow more energetic orbits through the 
cluster. Hence, at a given distance from the center of a cluster, spiral galax- 
ies should have larger random velocities than elliptical galaxies. Figure 4.9 
suggests that this effect is evident in observed galaxy velocities. However, 
some caution is appropriate in the interpretation of these data since one 
would expect the velocity dispersion within cluster galaxies to decrease with 
projected radius R. Hence the mean random velocity of all the spirals in a 
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cluster should be smaller than the mean random velocity of all the cluster’s 
ellipticals. In observed samples this effect is likely to be masked by the in- 
clusion within the spiral sample of many galaxies that lie either in front of or 
behind the cluster and are not physically associated with it: the measured 
‘random velocities’ of these foreground and background galaxies will reflect 
the Hubble flow rather than the velocity dispersion of the cluster. 

The first largescale study of morphological segregation was made by 
Dressler (1980), who obtained the morphological types of ~ 6000 galaxies in 
55 clusters. Dressler’s analysis confirmed that f(Sp) increases with R — see 
the left panel of Figure 4.10. In this larger data set, subtler effects are also 
evident. For example Figure 4.10 indicates that the fraction of S0s, f(S0), 
falls near the center and detailed analysis of these data shows that f (SO) 
decreases from 54 + 7% at 0.25Mpc to 30+ 5% at 0.075Mpc (Whitmore 
& Gilmore 1991). However, Dressler concluded that correlations involving 
R are not the fundamental ones, and that galaxy type is really dictated by 
the local density of galaxies. That is, Dressler argued that the fundamen- 
tal relations are morphology—density relations between the local galaxy 
density and f(Sp) and f(S0). The existence of such relations is illustrated 
in the right panel of Figure 4.10, which f(Sp) and f(SO) as a function of the 
projected number density of galaxies, N. 

It is extremely difficult, if not impossible, to ascertain which of the 
relations shown in Figure 4.10 is the more fundamental. The distribution of 
galaxies in clusters is such that N is a monotonic function of R: Beers & 
Tonry (1986) have shown that N « R7! over a large range of radii in many 
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clusters, while Merrifield & Kent (1989) found that at smail radii the relation 
is better described by an exponential, N « exp(—R/100kpc). This close 
relation between N and R implies that if morphology is correlated with R, it 
is necessarily also correlated with N, and vice-versa. Without invoking some 
underlying physical mechanism, it is thus essentially impossible to decide 
which correlation is the more fundamental. 

In fact, the true picture is a little more complicated. Although the den- 
sity of galaxies does broadly drop with radius in clusters, these systems also 
contain significant substructure in the form of local enhancements in the 
projected density of galaxies — see Figure 4.11. A simple argument shows 
that the existence of such substructure is a prerequisite for a physically- 
meaningful morphology-—density relation. In the course of its orbit, any given 
galaxy will travel a large distance through its cluster, and will normally pass 
through regions of very different galaxy density. It is thus somewhat surpris- 
ing that the morphology, which was presumably decided some time ago, is 
strongly correlated with the current surrounding density. The only way that 
the local density can dictate morphology is if each galaxy is accompanied 
by a large chunk of its local environment on its orbit. In other words, the 
cluster must contain a number of fairly coherent subgroups — there must be 
substructure. 

Several studies have sought to establish the role that substructure plays 
in dictating the morphologies of galaxies. Sanroma & Salvador-Solé (1990) 
have carried out a simple test which seems to show that substructure can- 
not be the driving mechanism. They took the data from Dressler’s study, 
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and produced a new data set by randomizing the positions of each cluster’s 
galaxies in azimuth whilst preserving their radial positions in the cluster. 
Thus, any small-scale substructure will be wiped out, while the large-scale 
radial variations in cluster properties are preserved. Repeating Dressler’s 
analysis on the “shuffled” clusters, they found that the morphology—density 
relation was essentially unaltered, and hence that this correlation cannot be 
primarily caused by small-scale substructure. 

Additional support for this view has come from a further analysis of 
Dressler’s data by Whitmore, Gilmore & Jones (1993). They considered 
only the data from the innermost parts of clusters, and looked to see how the 
fraction of ellipticals from these comparable radii varied with the density of 
their surroundings. This study found that ellipticals always make up ~ 50% 
of the central population, irrespective of the density of galaxies around them. 
It thus appears that a galaxy’s radius within a cluster is the primary factor 
that dictates its morphology. 

The issue is not entirely clearcut, however. The fact that Oemler (1974) 
found that wimpier clusters contain lower global fractions of ellipticals im- 
plies that some factor other than radius dictates the mix of galaxy types. 
Postman & Geller (1984) took this analysis further by studying the morpho- 
logy—density relation for even poorer groups of galaxies. This study showed 
that groups seem to obey the same relation as clusters: the inner parts of 
groups have comparable densities to the outer parts of clusters, and the 
distribution among the different morphological types in these two environ- 
ments are indistinguishable. In this case, therefore, the principle correlation 
seems to be the morphology—density relation. Combining the results from 
the groups and clusters, Postman and Geller were able to define a single, 
apparently Universal, morphology—density relation which is obeyed over six 
orders of magnitude in density. 

Even within rich clusters, where the morphology—radius relation appears 
to be more fundamental, there are phenomena that can only be attributed 
to local causes. Whitmore, Gilmore & Jones (1993) have found that where 
a galaxy has a nearby companion, it has a disproportionate probability of 
being elliptical. In the outer parts of clusters the elliptical fraction is only 
~ 15%, but close pairs of galaxies, separated by projected distances of less 
than ~ 0.05 Mpc, are ~ 55% elliptical even in the outer parts of clusters. 
This is clearly a very localized phenomenon, which falls under the category 
of a morphology-—density relation. 

It is apparent, and perhaps not surprising, that the morphology of a 
galaxy is dictated by a combination of factors, some of which relate to its 
immediate surroundings, while others reflect its broader environment. Try- 
ing to capture the complete physics of the issue in anything as crude as 
a morphology—radius or morphology—density relation is never going to do 
justice to this complex process, although it clearly provides a partial picture. 
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4.1.3 The galaxy luminosity function 


Just as the distribution of stellar luminosities reflects the physics of star 
formation and stellar structure, we might hope to learn about galactic evo- 
lutionary processes by studying the distribution of galaxy luminosities. This 
distribution is quantified in an analogous manner to that of stars (§3.6), via 
the galaxy luminosity function, ®(1/). The quantity ®(M)dM is pro- 
portional to the number of galaxies that have absolute magnitudes in the 
range (M,M+4dM). Conventionally, this function is normalized by setting 


/ POH aI y, (4.1) 
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where v is the total number of galaxies per unit volume, so that 6dM 
specifies the number density of galaxies in the magnitude range (M, M+dM). 


The field galaxy luminosity function In order to determine the generic 
behavior of the galaxy luminosity function, we need to look at the luminosi- 
ties of galaxies in some unexceptional region of the Universe. The classical 
procedure for estimating @(M) for such “field” galaxies follows a similar se- 
ries of steps to that involved in calculating the stellar luminosity function 
(§3.6). In its simplest form, it involves measuring the apparent magnitudes 
of all the galaxies in some representative sample. The individual brightnesses 
are converted to absolute magnitudes by estimating the galaxies’ distances, 
usually by applying the Hubble law to their observed redshifts (see §7.1). 
If the redshifts are substantial, then it will also be necessary to apply a K- 
correction to the magnitude, so that magnitudes calculated for galaxies at 
different redshifts are all converted to a common waveband (see §2.3.3). Fi- 
nally, one divides the number of galaxies in each absolute magnitude range 
(M,M + dM) by the volume of space that has been surveyed, V, to con- 
vert to the requisite galaxy number density. It should be recalled that V is 
likely to vary with M: since more luminous objects can be seen to greater 
distances, V is greater for intrinsically brighter galaxies in a survey that is 
complete down to some fixed limiting apparent magnitude. 

In practice, this simple approach has a variety of shortcomings. First, 
corrections must be made for the Malmquist-type biases that hamper the 
analysis of magnitude limited surveys. As we saw in §3.6.1, basic properties 
of the luminosity function will be distorted whenever the function has a finite 
spread in luminosity. In fact, such distortions occur even if galaxies all have 
intrinsically identical luminosities, but a range of estimated absolute magni- 
tudes due to errors in their adopted distances. Estimating the distances to 
galaxies via the Hubble law is an intrinsically approximate process, so errors 
of this kind are unavoidable. The problem is particularly acute for nearby 
galaxies, since the random motions of such systems can exceed their Hubble 
velocities (see §7.4.2), so the errors in their distance determinations can be 
very large. This shortcoming is unfortunate, since low-luminosity galaxies 
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can only be observed if they are nearby, so the faint end of the field galaxy 
luminosity function is rather poorly determined. 

A further complication to this analysis arises from the assumption im- 
plicit in the classical method that the galaxies in any absolute magnitude 
range (M,M+dM) are uniformly distributed through the surveyed volume 
V(M). We now know that this assumption is unfounded. Nearby in the 
Universe, there are large enhancements in galaxy density in some regions, 
such as the Virgo cluster. Even on larger scales, galaxies are arranged in 
large wall-like structures, with intervening voids which are almost empty of 
galaxies. Thus, for a given absolute magnitude M, the number of galaxies 
found within the volume V(M) can depend as much on the spatial distribu- 
tion of galaxies in the Universe as it does on ®(M). To obtain an unbiased 
estimate for ®(M), we must remove this spatial dependence. A variety of 
techniques for circumventing this problem are described by Binggeli, Sandage 
& Tammann (1988) and Efstathiou, Ellis & Peterson (1988).4 

When these effects are allowed for in the analysis of galaxy surveys, the 
field galaxy luminosity function is found to take a reasonably simple form. 
The number of galaxies drops monotonically with increasing luminosity: at 
faint absolute magnitudes, ®(M) decreases approximately exponentially with 
|M|, but brighter than some characteristic absolute magnitude, M*, 6(M/) 
cuts off very sharply. A convenient parameterization which reproduces these 
properties is the Schechter function, 


&(M) = (0.41n 10) 6*10°4(2+)(M"—) exp (—10°0"—40)) , (4.2) 


where ©*, M™* and a are chosen to fit the observations. This rather contrived- 
looking formula takes a simpler form if one works in luminosities rather than 
magnitudes: defining ®(L)dZ to be the number density of galaxies with 
luminosities in the range (LZ, [+ dL), the corresponding function is 


®(L) = (®*/L*)(L/L")* exp(—L/L*), (4.3) 


where [* is the luminosity corresponding to an absolute magnitude of M* 
(Schechter 1976). Thus, a sets the slope of the luminosity function at the 
faint end, L* or M* gives the characteristic luminosity above which the 
number of galaxies falls sharply, and ®* sets the over-all normalization of 
galaxy density. This formula was initially motivated by a simple model of 
galaxy formation (Press & Schechter 1974), but it has proved to have a wider 
range of application than originally envisaged. 

The Schechter function provides a reasonable fit to the luminosity func- 
tions derived from a number of galaxy redshift surveys. Efstathiou, El- 
lis & Peterson (1988) analyzed the data from several such studies, and 


4 With a spatially-varying density of galaxies, v in the normalization of equation (4.1) 
is taken to be the mean density of the volume surveyed. 
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found that almost all could be fitted by a Schechter function with ®* = 
(1.6+0.3) x 10-7h3 Mpc *, Mg = —19.7£0.14+5logh and a = —1.07+0.07 
as the parameters of the model.> The photometric data in these various sur- 
veys were obtained through different optical bandpasses, so assumptions had 
to be made about the colors of the galaxies in order to infer the common 
B-band luminosity function whose parameters are listed above. The value 
of M}, corresponds to a luminosity of L% = (1.2+0.1)h~? x 10?°Lo. 

One drawback to studying the luminosity function using blue light is 
that, as we have seen in §3.6, the young stars that produce most of the 
blue light are a small minority of the stellar population of the Milky Way, 
and hence, presumably, other galaxies. Thus, the luminosity function of 
galaxies in blue light may not be very representative of the average stellar 
properties of these systems. More recent studies have therefore been made 
of the galaxy luminosity function at infrared wavelengths, where the light 
is dominated by more “typical” stars. The K-band luminosity function is 
also found to be well-represented by the Schechter function, with best-fit 
parameters of 6* = (1.60.2) x 107?h3 Mpc7*, Mz = —23.1+0.2+ 5logh 
and a = —0.9+0.2 (Gardner et al. 1997). Apart from the difference between 
Mj and Mx (which can largely be attributed to the intrinsic B — K colors 
of typical galaxies), the parameters of the best-fit Schechter functions in the 
optical and infrared are indistinguishable. It would therefore appear that 
the optical data are reasonably representative of the galaxies’ over-all stellar 
populations. 

Since a seems to lie close to —1, there is a serious danger that integration 
of equation (4.3) over luminosity will imply that there are infinite numbers 
of low-luminosity galaxies. This divergence is not as implausible as it might 
seem: the total luminosity density contributed by the Schechter function is 


hot = | L®(L)dL = &)L*T(2 +a), (4.4) 
0 


where I is the gamma function. For a = ~—1, we obtain li. = @pL*. Thus, 
although the Schechter function predicts huge numbers of low luminosity 
galaxies, the integrated contribution from all galaxies to the total luminosity 
density of the Universe remains finite, as the observed darkness of the night 
sky requires. In reality, the divergent form of the galaxy luminosity function 
probably breaks down at some point. However, as we shall see in §4.1.4, in 
the one place where we can readily detect low-luminosity galaxies, we do, 
indeed, find that they exist in large numbers. 

With larger, deeper surveys, the limitations of the simple Schechter 
function start to become apparent. Figure 4.12 shows the luminosity function 


5 The quantity h parameterizes our ignorance as to the exact conversion from redshift 
to distance {see equation (7.3)]. It almost certainly has a value somewhere between 0.5 
and 1. 
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Figure 4.12 Luminosity function as derived from the Las Cam- 
panas Redshift Survey, together with the Schechter function that 
best fits the data at points brighter than M = —17.5 (with param- 
eters a = —0.70+ 0.05 and Mp = —20.29 + 0.02 + 5logh.) The 
component luminosity functions of emission-line and non-emission- 
line galaxies in the survey are also shown. [After Lin et al. (1996) 
from data kindly provided by H. Lin] 


derived from the Las Campanas Redshift Survey of some 19000 galaxies. 
Although fitting a Schechter function to the bright end of this luminosity 
function reproduces the observations very well, it does not correctly predict 
the number of faint galaxies. 


The cluster galaxy luminosity function Clusters of galaxies provide 
a very different environment for galaxy formation and evolution. It is there- 
fore interesting to see whether the same luminosity function is found in rich 
clusters as is found in the field. In some ways, the luminosity function of a 
cluster is easier to obtain than that of the field: cluster members lie within a 
small region of the sky, so their photometry can be obtained efficiently with 
few images. In addition, members are all at essentially the same distance, 
so one only needs to obtain the single distance to the cluster in order to cal- 
culate the galaxies’ absolute magnitudes. The only real problem in studying 
cluster luminosity functions is that rich clusters are quite rare, so they are 
typically at large distances, making it hard to obtain photometry for their 
fainter members. 
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Figure 4.13 Number of galaxies as a function of absolute magnitude [x ®(M)] found in 
the central regions of the Virgo, Centaurus and Fornax clusters [From data published in 
Jerjen & Tammann (1997)] 


Studies of some of the nearer clusters have revealed that their luminos- 
ity functions are fairly similar to that of the field, and are reasonably well 
represented by Schechter functions. However, the parameters of the best 
fit do seem to differ from those of the field. Trivially, 6* will be larger 
due to the higher density of galaxies in a cluster, but other differences are 
also apparent. In a study of nine moderately-rich clusters, Lugger (1986) 
found that on average these systems have Schechter function parameters of 
Mz = —19.5 + 0.14 5logh and a = —1.27+ 0.04. Thus, although the cut- 
off magnitude is indistinguishable from that of the field galaxy luminosity 
function, the slope of the faint end of the luminosity function is significantly 
steeper. 

The galaxy luminosity function does not seem to vary greatly from clus- 
ter to cluster: Lugger (1986) found all the clusters in her sample to have a lu- 
minosity function that is reasonably well reproduced by the average Schechter 
function described above. However, she also noted that there is structure ap- 
parent in the luminosity functions of some clusters that is inconsistent with 
the smooth Schechter function, and that the form of this structure varies 
between clusters. Some departures from the Schechter function do occur in 
more than one cluster, though: quite frequently, there seems to be a dip 
in the luminosity function at Mg ~ —16+ 5logh, and an excess of galax- 
ies at the bright end of the function. In fact, the existence of an excess of 
bright galaxies in clusters was recognized in the original paper by Schechter 
(1976), where he pointed out that cD galaxies (see §4.3.1), with typical lumi- 
nosities of 10Z*, cannot fit satisfactorily within the simple form of equation 
(4.3). These localized features complicate attempts to resolve the question 
of whether all clusters share similar over-all luminosity functions, since the 
derived values of the Schechter function parameters can be grossly distorted 
if the magnitude range over which the fit is made happens to include such a 
small-scale distortion. 
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The same story also seems to hold when we look at poorer clusters. 
Figure 4.13 shows the luminosity functions of three closer, poorer systems. 
Since these clusters are too close for reliable distances to be obtained from 
the Hubble law (see §7.4.2), the calibration on to an absolute magnitude 
scale is not directly comparable to the above analyses. However, it is clear 
from Figure 4.13 that the over-all picture remains the same: the luminosity 
function declines steadily with luminosity before cutting off rather sharply 
at a maximum luminosity. There are also clear signs in these data of sub- 
structure in the luminosity function, which varies from cluster to cluster, and 
which cannot be described using the Schechter function. 


The luminosity function divided by morphological type. A clue 
to understanding the complexity of the luminosity function is offered by 
Figure 4.12. If field galaxies are divided into two sub-groups, depending on 
whether or not their spectra contain significant emission lines, one finds that 
the luminosity functions of the two sub-groups differ significantly. Since 
emission lines are associated with cool gas, which is only found in large 
quantities in spiral and irregular galaxies, these sub-groups essentially divide 
the late-type (Sa — Irr) from the early-type (E — SO) galaxies. Given the very 
different appearances of these sub-classes of galaxies (see §4.1.1), one might 
reasonably suppose that their members have followed disparate evolutionary 
paths, and so the difference in their luminosity functions is not unexpected. 
If anything, it is surprising that their luminosity functions are as similar as 
they appear: fitting Schechter functions to the individual subgroups, one 
finds values of Mj that differ by only a few tenths of a magnitude. Whether 
this similarity tells us something about the common ancestry of these objects, 
or whether it is simply a coincidence, remains to be seen. 

As Figure 4.14 shows, if one sub-divides the morphological types of 
galaxy more finely, one finds that different classes of galaxy have very differ- 
ent luminosity functions. In their decomposition of the luminosity function 
into its constituent parts, Jerjen & Tammann (1997) have found that spi- 
ral galaxies (Sa — Sc) scatter around a well-defined mean luminosity, and 
their luminosity function can be adequately described by a Gaussian with 


a mean magnitude of M3P = —16.8+ 5logh and a dispersion of op = 1.4 
magnitudes. SO galaxies are found to follow a similar distribution, but are 
marginally brighter, with M3 = ~—17.5+ 5logh and of? = 1.1 magni- 
tudes. Ellipticals show a slightly more complex distribution: although they 
are also distributed about a fairly well-defined characteristic luminosity of 
ME = —16.9+5 log h, their distribution is skewed toward an excess of bright 
galaxies. This distribution can be characterized by a modified Gaussian dis- 
tribution, 


}n(Mp) « exp (—[MB - Mal/[2{o5(Ma)}?1) , (4.5) 


where of(Mg) = 2.2 if Mg < ME, or 1.3 if Mg > ME. Irregular galaxies 
are, on average, much fainter. In fact, as one looks to fainter magnitudes, 
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Figure 4.14 Luminosity functions for galaxies of various morpho- 
logical types. The top panel shows The separate functions at ar- 
bitrary normalization, while the lower panels show approximately 
how these components combine to produce the total luminosity 
function in the field and in clusters. 


their numbers continue to grow, with no clear signs of a lower limit to their lu- 
minosity function. The faint nature of these systems makes their luminosity 
function hard to characterize, but it can be crudely described by a Schechter 
function with parameters Mf(Irr) = ~15 + 5logh and a(Irr) = —0.3. 
The dwarf elliptical galaxies show a similar open-ended luminosity function, 
which can be fitted by a Schechter function with Mg(dE) = —16 + 5logh 
and a(dE) = —1.3. 

The disparity in the luminosity functions of galaxies of different types 
goes a long way toward explaining the variations in the total luminosity 
function described above. As we saw in 84.1.2, the mix of galaxy types 
varies with the density of the surroundings. As Figure 4.14 shows, by adding 
together the mixtures of galaxies appropriate for field and cluster popula- 
tions, the differences in their luminosity functions become self-explanatory. 
The high-luminosity cutoffs in these different environments occur at approx- 
imately the same magnitude because spiral galaxies and elliptical galaxies 
have comparable characteristic luminosities. The steeper slope at the faint 
end of the cluster luminosity function arises from the substantial popula- 
tion of dwarf ellipticals in these systems. We can even explain much of the 
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small-scale structure, such as the dip in the cluster luminosity function at 
Mz ~ —16, which arises at the point where domination of the luminosity 
function transfers from dwarf to giant galaxies. 

Although it is now clear that there is no “universal” total luminosity 
function, it has been argued that the shapes of the luminosity functions of 
the individual morphological types may be invariant, with just their am- 
plitudes changing with environment (Binggeli, Sandage & Tammann 1988). 
To some extent, the truth of this argument must depend on the definitions 
of morphological types adopted. For example, cD galaxies are almost al- 
ways found in regions of high galaxy density. If these galaxies are lumped 
in with the ellipticals, then the elliptical galaxy luminosity function varies 
with galaxy density and is not universal. If there do exist distinct cate- 
gories of galaxies whose luminosity functions have universal shapes, then the 
forms of these function may contain clues as to the histories of these classes 
of galaxy. Further, a characteristic feature in such an invariant luminosity 
function might provide a suitable “standard candle” for estimating the dis- 
tances to quite remote clusters (see §7.3.2). The existence (or non-existence) 
of universal luminosity functions for particular morphological types therefore 
remains an important issue in extragalactic astronomy. 


4.1.4 The Local Group 


As we have seen above, the number density of galaxies increases as one con- 
siders fainter systems. Thus, the large spiral and elliptical galaxies, which 
have traditionally caught the eye of astronomers, do not by any means rep- 
resent a fair sample of the galaxies that populate the Universe. Further, 
we have seen in §4.1.2 that the mix of galaxies of different types depends 
quite strongly on environment. Therefore, if we are to ask questions about 
the “typical” properties of a population of galaxies, we must avoid looking 
in atypical regions such as rich clusters. Fortunately, the Milky Way is lo- 
cated in a remarkably dull corner of the Universe, so that we can assess a 
typical galaxy population simply by looking at our neighbors. By looking so 
nearby, we can also readily observe the large numbers of faint galaxies that 
are all-but undetectable at greater distances. 

The loose collection of galaxies to which the Milky Way belongs is known 
as the Local Group. The criteria for membership of the Local Group are 
somewhat nebulous. One could simply consider all the galaxies within a dis- 
tance of, say, 1.5 Mpc to be members, but such a choice affords the Milky 
Way the unwarranted distinction of being at the group’s center. Alterna- 
tively, one could examine the radial velocities of nearby galaxies to determine 
which objects are gravitationally bound to the same system of galaxies as 
the Milky Way. Unfortunately, this criterion for membership is not entirely 
clear-cut either, since the uncertainties in the total mass of the Local Group 
and our ignorance of candidates’ transverse velocities mean that we cannot 
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be sure which galaxies are free to depart. We also run into the problem that 
the Local Group is almost certainly gravitationally bound to other nearby 
groups, so we have to consider whether these other systems are themselves 
simply sub-units of the Local Group. Finally, it should be borne in mind that 
gravitational interactions between galaxies will occasionally cause a galaxy 
to be flung out of the Local Group, or to be captured from a neighboring 
group, so membership is not an entirely static property. 

Despite the absence of a clear-cut criterion, there is general agreement 
that the 35 galaxies listed in Table 4.3 belong to the Local Group. The 
brightest member of the group is the Andromeda Galaxy, M31, with the 
Milky Way and M33 making up the complement of spirals. As befits its 
poor environment, there are no particularly bright ellipticals in the group, 
but dwarf ellipticals and dwarf spheroidals make up more than half the total 
membership. The remaining galaxies, including our near neighbors the Large 
and Small Magellanic Clouds, are irregulars. 

Figure 4.15 shows the untidy appearance of the Local Group. The sys- 
tem lacks any real symmetry, but rather divides into two main groupings 
centered on the dominant spiral galaxies, M31 and the Milky Way. There 
also seems to be a third sub-group loosely clustered around NGC 3109. Since 
the galaxies in each of these sub-groups tend to have similar line-of-sight ve- 
locities, it would appear that they represent physical associations rather than 
transient conjunctions. Although a lot of Local Group galaxies are associ- 
ated with these three sub-groups, some members seem to follow more solitary 
orbits. 

It is very unlikely that Table 4.3 provides a complete list of members 
of the Local Group. Figure 4.15 shows that very few faint members have 
been identified close to the Galactic plane, particularly in the direction of 
the Galactic Center. This deficit arises because dust obscures our view in 
these directions, coupled with the difficulty in identifying diffuse low surface- 
brightness objects against the bright foreground starlight of the Galactic 
plane. It was on account of this latter difficulty that the Milky Way’s clos- 
est neighbor, the Sagittarius Dwarf, was only discovered in 1994 (Ibata, 
Gilmore & Irwin 1995). Although this system is relatively bright for a dwarf 
spheroidal, it eluded previous detection because it happens to lie almost di- 
rectly behind the Galactic center. Its discovery was actually hampered by 
its proximity, since the large area of the sky over which its emission is spread 
(~ 8° x 2°) made it extremely difficult to pick out against the bright Galac- 
tic foreground light. Even away from the Galactic plane, new faint members 
of the Local Group continue to be identified: Whiting, Irwin & Hau (1997) 
obtained CCD observations of a faint smudge on sky survey plates in the con- 
stellation of Antlia, which confirmed that it was another dwarf spheroidal 
member of the Local Group. Thus, we still don’t know the full complement 
of fainter Local Group galaxies; all we can say with reasonable confidence 
is that more than 90% of the galaxies in the Local Group have absolute 
magnitudes fainter than My ~ —18. 


4.1 Morphological classification 


Table 4.3 Local Group members 


Name Alternate Coordinates Type 
Name RA (1950) Dec 
M31 NGC 224 00 40.0 +40 59 Sb 
Milky Way Galaxy 17 42.4 —28 55 Sbe 
M33 NGC 598 01 31.1 +30 24 Sc 
LMC 05 24.0 —69 48 Irr 
IC 10 00 17.7 +59 01 Irr 
NGC 6822 DDO 209 19 42.1 —14 56 Irr 
M32 NGC 221 00 40.0 +40 36 dE2 
NGC 205 00 37.6 --41 25 dE5 
SMC 00 51.0 —73 06 Irr 
NGC 3109 DDO 236 10 00.8 —25 55 Irr 
NGC 185 00 36.2 +48 04 dE3 
IC 1613 DDO 8 01 02.2 +01 51 Itr 
NGC 147. DDO3 00 30.5 +48 14 dE4 
Sextans A DDO75~ 10 08.6 —04 28 Irr 
SextansB DDO70 09 57.4 +05 34 Irr 
WLM DDO 221 23 59.4 —15 45 Irr 
Sagittarius 18 51.9 —30 30 dSph/E7 
Fornax 02 37.8 —34 44 dSph/E3 
Pegasus DDO 216 23 26.1 +14 28 Irr 
Leo I DDO 74 10 05.8 +12 33 dSph/E3 
Leo A DDO 69 09 56.5 +30 59 Irr 
And II 01 13.5 +33 09 dSph/E3 
And I 00 43.0 +37 44 dSph/EO 
SagDIG 19 27.9 -17 47 Irr 
Antlia 10 01.8 —27 05 dSph/E3 
Sculptor 00 57.6 —33 58 dSph/E3 
And III 00 32.6 +36 12 dSph/E6 
Leo II DDO 9311 10.8 +22 26 dSph/EO 
Sextans 10 10.6 —01 24 dSph/E4 
Phoenix 01 49.0 —44 42 Irr 
LGS 3 O01 01.2 +21 37 Irr 
Tucana 22 38.5 —64 41 dSph/E5 
Carina 06 40.4 —50 55 dSph/E4 
Ursa Minor DDO 199 15 08.2 +67 23 dSph/E5 
Draco DDO 208 17 19.2 +57 58 dSph/E3 


171 
Distance My 
(kpc) 
725 —21.1 
8 —20.6 
795 —18.9 
49 —18.1 
1250 —17.6 
540 —16.4 
725 —16.4 
725 —16.3 
58 —16.2 
1260 —15.8 
620 —15.3 
765 —14.9 
589 —14.8 
1450 ~—14.4 
1300 —14.3 
940 —14.0 
24 —14.0 
131 —13.0 
759 —12.7 
270 —12.0 
692 —11.7 
587 —11.7 
790 —11.7 
1150 —11.0 
1150 —10.7 
78 —10.7 
790 —10.2 
230 —10.2 
90 —10.0 
390 — 9.9 
760 —9.7 
900 — 9.6 
87 — 9.2 
69 — 8.9 
76 — 8.6 


SOURCE: From data kindly provided by M. Irwin. 
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Figure 4.15 Galactocentric coordinates of Local Group galaxies. 
The Milky Way lies at the origin, with the Galactic plane at z = 0, 
and the vector from the Sun to the Galactic center parallel to the 
z-axis. Galaxies at positive Galactic latitudes are shown as filled 
points, while those at negative latitudes are shown as open symbols. 
The projection on to the z = 0 plane is also indicated. The size of 
the symbol increases with the luminosity of the galaxy. 


4.2 Surface Photometry of Galaxies 


An essential step towards putting the morphology of galaxies on a quantita- 
tive basis is the measurement of galaxies’ surface-brightness distributions.® 
This is a surprisingly difficult undertaking for two reasons. First the sky 
is never completely dark. Second, the atmosphere is not completely trans- 
parent. In the next section we discuss the way in which these phenomena 
limit our knowledge of the brightness distributions of galaxies. Then we de- 
scribe the characteristic brightness distributions of galaxies of each Hubble 
type. More details of the techniques of surface photometry may be found 
in de Vaucouleurs (1984), Djorgovski & Dickinson (1989) or Morrison e¢ al. 
(1997). 


6 For a quantitative classification of the Revised Shapely-Ames galaxies (Sandage & 
Tammann 1981) see Michard & Marchal (1994). 
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4.2.1 The night sky 


In §2.3.6 we saw that the surface brightness of a given point in a galaxy 
is independent of the galaxy’s distance from us. We saw also that optical 
astronomers measure surface brightnesses in units of magnitudes per square 
arcsecond, here abbreviated yz, wR, etc., depending on the waveband of the 
measurement. We shall find that half of the light in a typical giant elliptical 
galaxy comes from points at which the surface brightness is less that ~ 22 wz, 
while essentially all the luminosity of a typical spiral galaxy comes from 
points at which the surface brightness is less than 21.5 45. Unfortunately, the 
surface brightness of blank sky at a good site on a moonless night typically 
lies near 23 pp (see Table 4.4). To obtain a reasonable understanding of the 
light distribution of a galaxy, its brightness distribution must be followed 
down to below 268, that is, to less than 6% of the night sky brightness, 
and sometimes it is necessary to follow a galaxy down to less than 0.5% of the 
sky. Clearly, such measurements will be meaningless unless the brightness of 
the night sky can be extremely accurately determined. 

The brightness of the moonless night sky is made up of four contributions 
(Allen 1973, Dube, Wickes & Wilkinson 1972): 


1. Air glow produced by photochemical processes in the upper at- 
mosphere. This component has a very irregular spectrum, but it is 
most prominent at long wavelengths. Its intensity varies from point 
to point on the sky in an irregular way, while, on the average, in- 
creasing from latitude 20° to latitude 70° by about a factor of two. 
It fluctuates in magnitude by about 20% on timescales that can be 
as short as ten minutes. At many observatories, this component is 
augmented by mercury and sodium line radiation from the street 
lamps of nearby cities. 

2. Zodiacal light, which is sunlight scattered off particulate matter in 
the solar system. 

3. Faint and unresolved stars in our Galaxy. 

4. Diffuse extragalactic light, coming from distant, faint, unresolved 
galaxies. 


The relative proportions of these various components, and the total 
intensity they produce when added together, vary from one observing site 
to another. They also vary with galactic and ecliptic longitude and latitude. 
Generally, zodiacal light is the greatest source of brightness, followed by air 
glow and diffuse galactic light. Diffuse extragalactic light usually makes the 
smallest contribution to the night sky. Together, these components shed 
more light onto the surface of the Earth than all the resolved stars, nebulae, 
and galaxies put together. 

Table 4.4 gives typical values for the brightness (in magnitudes per 
square arcsec) of the night sky in five wavebands. Notice that the night 
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Table 4.4 Typical values brightness of the night sky 


Hu UB Ly HR LI 
22.0 22.7 21.8 20.9 19.9 
SOURCE: NOAO newsletter 37, 1994 March 1 


sky is rather red; it has a color index near (B — V) = 0.9, similar to that of 
a fairly red galaxy. Seen from a space vehicle, the sky is darker by a factor 
that varies with waveband and distance from the ecliptic. In the V band 
this factor can be as large as 1.5 mag, and still larger factors can apply at 
longer wavelengths. Moreover, space-based photometry is relatively easy to 
correct for the night-sky brightness, because airglow, which varies rapidly in 
time and space, does not contribute to space-based data. 

The first steps in the reduction of photometric observations of a galaxy 
are: (i) to determine the sensitivity of each pixel of one’s imaging device; and 
(ii) to measure the dimming of images towards the edge of the telescope’s field 
of view — that is, the vignetting of the field. The process of correcting for 
these errors is called flat-fielding, and errors arising from this step usually 
dominate the final error budget, especially in the case of HST observations. 

Once the imaging device has been flat-fielded, one has to subtract the 
contribution of the sky background from the flux measured in each region of 
the sky. It is apparent from the numbers given above that this contribution 
will, in general, be much greater than the contribution of the galaxy itself. 
Moreover, the sky brightness at the location of the galaxy cannot be directly 
measured. One must infer the sky brightness at the galaxy’s location by ex- 
trapolation from neighboring portions of ‘blank’ sky. This last requirement 
makes it desirable to use a detector that covers the largest possible area of 
the sky, especially if the galaxy being observed has a large angular diameter.” 
Suppose that the errors in this combined extrapolation are comparable to 
3% of the measured value. Then the uncertainty in the galaxy brightness 
becomes of order unity where the surface brightness is of order 265. Re- 
markably, surface-brightness data are commonly reported to be reliable to 
even below this value. 

Figure 4.16 indicates how the derived brightness profile will depend upon 
the chosen sky-brightness by showing the profile of a galaxy, whose true 
profile follows the R1/+ law, as one would observe it for three different sky- 
brightness levels. The left-hand panel shows the profile one obtains with teo 
low a background level. The profile declines almost as a power law [a straight 
line in the (w, log R) plane] before flattening off at some brightness level 2.0, 
which gives the error in the assumed background level. The center panel in 
Figure 4.16 shows the profile one obtains with an overestimated sky level. 
The brightness profile now steepens steadily and plunges towards zero at 


7 Inadequate sky coverage was for many years a problem with CCD observations of 
reasonably nearby galaxies and remains a serious problem with HST data. 
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Figure 4.16 The effects of errors in the subtracted sky background on the profile of a 
galaxy with a truncated brightness distribution. Left: the background has been underesti- 
mated so that the night sky contributes to the “galaxy” brightness at large radii. Center: 
the background has been overestimated with the consequence that the surface brightness 
appears to go to zero at the ‘tidal radius’ R,. Right: with correctly estimated background 
level the galaxy is seen to follow the R'/* law [equation (4.18) below]. 


some finite ‘tidal radius’ R,. Finally, the right-hand panel shows the shape 
of the profile one obtains with the correct background. This figure shows 
that an overestimated background level, in contrast to an underestimated 
one, gives rise to a an entirely plausible surface-brightness profile for the 
galaxy. Consequently, a profile like that of the center panel in Figure 4.16 
cannot by itself be taken as evidence that the galaxy in question has no 
luminous material beyond R. 

This discussion shows that small errors in the sky-brightness can re- 
sult in large errors in the derived surface-brightness profile at large radii. 
Unfortunately, these errors will lead to even more dramatic errors in the 
derived colors of galaxies at large radii, since these have to be obtained by 
differencing two derived surface-brightness profiles. 

Once one has decided how much to subtract from the measured sur- 
face brightness to obtain the galaxy’s surface brightness, one can integrate 
the latter over the area of the image to obtain the galaxy’s integrated light. 
Next one wishes to express this as an apparent magnitude, which is essen- 
tially the ratio of the galaxy’s light to the light from a standard star. Ideally 
this is determined by using the same equipment to measure brightness of a 
standard star. Sometimes, however, the CCD photometry is calibrated by 
reference with a published apparent magnitude for the galaxy. This will usu- 
ally have been determined by observing both the galaxy and a standard star 
with a non-imaging photoelectric detector. The weakness of this procedure 
is that the spectral responses of the CCD and the photoelectric detector will 
be different, as will the colors of the galaxy and the standard star. These 
differences oblige one to use necessarily approximate color transformations. 
One not infrequently encounters zero-point offsets of order 0.2 mag between 
the results of different observers, in part because determination of the pho- 
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Figure 4.17 The dashed curves show the true central surface- 
brightness profiles of circularly-symmetric galaxies in which p ~ 
r-1 and p ~ r~° at small radii. The full curves show the pro- 
files that would be measured for these galaxies if the PSF were a 
Gaussian of dispersion o. 


tometric zero-point involves comparing the brightnesses of objects that are 
widely separated on the sky. Since the properties of the sky vary very little 
on the angular scales of typical galaxies, the relative brightnesses of different 
points within a single image should be subject to errors that are very much 
smaller than 0.2 mag. 


4.2.2 Effect of seeing 


In §3.2.3 we encountered the phenomenon of ‘seeing’: the blurring of images 
by the atmosphere. We now investigate how seeing degrades measurements 
of the surface-brightnesses of galaxies. 

The seeing is characterized by the point-spread function (PSF) P(d) 
which gives the probability density that a photon will hit the imaging device 
at a point that is offset by the vector d from where it would have hit in 
the absence of seeing. If, in the absence of seeing, the surface brightness 
at position R’ in the detector is I,(R’), then the measured brightness at 
location R will be 


Liop( he) = i d?R’ P(R — R’)I,(R’). (4.6) 


The simplest case to consider is that of a circularly-symmetric surface- 
brightness distribution f,(r’) and a Gaussian PSF. If the dispersion of the 
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Figure 4.18 Isophotes for a model 


2 ee ees E5 galaxy observed with a Gaussian 
Pier ar) PSF of dispersion ¢. The galaxy’s 


0 2 4 intrinsic surface brightness is propor- 
x/o tional to [x? + (y/2)?}—1/2. 


PSF is denoted o then we have 


P(d) = +; exp (- 45) (4.7) 


=r exp( —=> 
2ro? 20? 
and it is not hard to show® that equation (4.6) can be reduced to 


loo} RR 
Rew i ak’ R'L(R)Io(—-),; (4.8) 

0 
where Ip is a modified Bessel function of order zero. We shall find that the 
true surface-brightnesses of galaxies tend to diverge as R — 0 as negative 
powers of R; that is for R small we have I, ~ R™7, withO < y < 1. If 
we plug this form of J, into equation (4.8), we obtain the apparent surface- 
brightness profiles shown in Figure 4.17. For R S o the measured surface 
brightness is smaller than the observed surface brightness, while for R 2 0 
the measured surface brightness is larger than the true surface brightness 
since the light removed from the smallest radii has to emerge at slightly larger 
radii. The overall effect of seeing is to introduce into a featureless power- 
law profile an apparent core, that is, a central region of nearly constant 
apparent surface brightness. Since some intuitively attractive theoretical 
galaxy models possess similar cores, it was until recently widely believed 
that the cores in apparent surface-brightness profiles reflected cores in the 
true brightness profiles rather than artifacts introduced by seeing. 

Now consider the effects of seeing on a galaxy image that is not circularly 
symmetric. Figure 4.18 shows isophotes of a model galaxy in which the 
intrinsic surface brightness is proportional to [x? + (y/2)?]~1/? when the 
galaxy is observed with a Gaussian PSF of dispersion o. In the absence of 
seeing all isophotes would have axis ratio 2:1, but it is clear that the central 
isophotes in Figure 4.18 are rounder than those further out. The full curve 
in Figure 4.19 quantifies this impression by plotting the apparent ellipticity 


€=1-b/a (4.9) 


8 See Problem 4.1 
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Figure 4.20 Left: an outer boxy isophote fitted by an ellipse with geometry of a4 deter- 
mination shown. Right: a disky isophote and its fitting ellipse. 


of the isophotes as a function of the length a their semimajor axes. Even 
the isophote with semimajor axis length a = 40 is made appreciably rounder 
by seeing. The dashed curve in Figure 4.19 shows that the effects of seeing 
on the measured ellipticity are less pronounced when the surface-brightness 
increases less steeply towards the center: the dashed curve shows the same 
quantities as the full curve except for an intrinsic surface brightness propor- 
tional to [x? + (y/2)?]-¥/4. 

Isophotes are generally not exactly elliptical. In particular, some iso- 
photes are boxy like the outer isophote in Figure 4.20, while others, like 
the inner isophote in Figure 4.20 resemble a leaf — such isophotes are called 
disky. The boxiness or diskiness of isophotes is often quantified by mea- 
suring a quantity denoted a,. Figure 4.20 illustrates this process. First the 
ellipse R.(¢) is fitted to the isophote. Then for each angle ¢ one determines 
the distance 6(¢) = Ri(¢) — Re(¢) between the radii of the corresponding 
points on the ellipse and on the isophote. Finally one expresses the function 
6(@) as a Fourier series: 


6(¢) =6 + > Gn cosngd + Dy bn sin nd. (4.10) 


n=1 n=1 


If the isophote is four-fold symmetric and the ellipse has been correctly fitted, 
6, @1, @2, a3 and all the b,, should be small. If the isophote is disky, a, will 
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Figure 4.21 The diskiness [equation (4.10)] of NGC 4697 is large from the center out 
to R ~ 4Qarcsec, and from there out is effectively zero (left panel). The right panel 
demonstrates that this result reflects the existence of a disk at the center of NGC 4697 
by showing an image of the galaxy from which a best-fitting elliptical model has been 
subtracted. [After Peletier et al. (1990) from data kindly supplied by R. Peletier] 


be greater than zero, while a4 is less than zero for a boxy isophote. The 
quantity a4/a, where a is the length of the isophote’s semimajor axis, forms 
a dimensionless measure of the diskiness of the isophote. 

The diskiness a4/a is even more strongly diminished by poor seeing than 
is €. 

Figure 4.21 illustrates the power of a4/a to detect a buried disk by 
showing in the left panel the diskiness of NGC 4697 and in the right panel a 
picture of the galaxy after subtraction of the best-fitting elliptical model of 
the system.® The diskiness is large out to R ~ 30” and falls to zero beyond 
R~ 50". The subtracted galaxy image clearly shows a band of luminosity 
along the major axis, which is almost certainly a disk. 


4.2.3 Deprojecting galaxy images 


What can we infer about the three-dimensional luminosity density j(r) in a 
transparent galaxy from its projected surface-brightness distribution [(R)? 
If 7(R) is circularly-symmetric, it is possible that j(r) is spherically sym- 
metric. Figure 4.22 shows that in this case, 7 is related to I by 


1(R) = if ey ee (4.11) 


= rR Vr2— RP 


9 The technique of showing the difference between an image and a smooth model of 
the image is called unsharp masking. In early work the smooth model was often just an 
out-of-focus copy of the image itself. Now it is more likely based on either ellipse-fitting 
or digital smoothing of the original image. 
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sight. 


This is an Abel integral equation for 7 as a functional of J, and its solution 
is [see §4.2 of BT for details] 


: 1 f¢* di dR 


While equation (4.12) is useful when I(R) is available as a smooth function, 
it should not be used when I(R) is contaminated by noise, as it will be if it 
has been inferred from surface-brightness measurements. The Richardson— 
Lucy technique described in Appendix C provides an appropriate means of 
recovering j(r) from a noisy surface-brightness profile —- see Binney & Mamon 
(1982) for details. 

Often it is desirable to fit to an observed surface-brightness profile a 
formula that corresponds to a simple analytical form of j(r). Perhaps the 
simplest pair of formulae that are related by projection is 


I jo 


I(R) = meat j(r) = fit (Jr) 572" 


(4.13) 


where Ig = 2rojo. The surface-brightness profile given by (4.13) is known 
as the modified Hubble law because of its similarity to Hubble’s law 
introduced in Box 4.1 below. Notice from equations (4.13) that I ~ R~? 
for R > ro, while j ~ r~3 for r > ro. It is, in fact, a general property of 
spherical models that if the luminosity density falls as a power law at large r, 
the surface-brightness falls as a power law with an exponent that is smaller 
by unity than the exponent of the luminosity density. 

A significant problem with the modified Hubble law is that it predicts 
that the total luminosity interior to projected radius R diverges as log R: 


L(R) = 20 a I(R’)R' dR! = mrp in [1 + (R/ro)?]. (4.14) 
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Table 4.5 Surface brightness profiles associated with equation 
(4.15) 


y I(R)/(L/na?) R./a 
_9 _ 1362 2 2\) 2 _ 4/-1/2 
r 2 — 13s? + 3s*(4 + s*)|s* — 1| c(s) eae 
4(s? — 1)8 
Oe ies | eh ees ae 
; (2 + s?)|s? — 1|~*/%c(s) —3 hie 
2(s? — 1)? 
p88 DFE) | ve - EES) |g 
2 2(s?-1)? = /s(s +1)8/2(s—1)? (8 +: 1)3/2(s—1)?_ 
w 1 1 (2-87)c(s) 
a 4s 52-1 (: |s2 — 1|1/2 ) atee 


a oo = [2 _ Jcosh “(s~t) fors <1, 
NOTES: s = R/a; so= S41? c(s) = ere en > 1. 


E and F are incomplete elliptic integrals. 


Clearly, a real galaxy could not follow this law all the way to R = oo. 
A family of simple density profiles that does not suffer from this defect 
is given by (Dehnen 1993) 


3-47 La 
4n r¥(r+a)4-7 


ir) = (4.15) 


Here L is the total luminosity of the model and a is a characteristic scale 
length. When 7 is either an integer or a half-integer, this form of 7 yields 
an analytically tractable projected density distribution — see Table 4.5 for 
details. Three interesting special cases are noteworthy within this one- 
parameter family of Dehnen models. The model with y = 1 is known 
as the Hernquist model (Hernquist 1990), the model with y = 3 yields 
the closest approximation to the ‘R!/4 model’ that is defined below [equation 
(4.18)], and the model with 7 = 2 is known as the Jaffe Model (Jaffe 1983). 

It is instructive to consider the surface brightness of a general Dehnen 


model. Inserting equation (4.15) into (4.11) and making the substitution 
r = Recoshy, we find 


sinh!~7 o dw 
9 (1+ Rcoshy)4-7" 


I(R) = R77 (4.16) 


When vy > 1, sinh is raised to a negative power on the top of the in- 
tegrand in (4.16). On account of this the dominant contributions to the 
integral comes from small values of w, at which the bottom of the integrand 
is approximately equal to unity. Hence, for y > 1 the integral in (4.16) is 
approximately independent of R for R < 1 and I diverges as R!~7 at small 
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R. If, by contrast ~ < 1, then sinhy is raised to a positive power in (4.16) 
and the dominant contributions to the integral come from those values of » 
for which Rcoshy ~ 1. In these circumstances the integral is sensitive to 
the value of R and we no longer have J ~ R1~7.*° In fact, if RK 1, we can 
argue that for the dominating values of 2), sinh ~ cosh ~ R71 and when 
we use this estimate to evaluate the integrand, we find that I is approxi- 
mately independent of R for very small R. These approximate results are 
confirmed by the exact results listed in Table 4.5. Thus the case y = 1 is a 
sort of watershed: luminosity densities that increase inwards more strongly 
than r—! give rise to cuspy surface brightness distributions, while less steeply 
rising luminosity densities give rise to surface brightness profiles that tend 
to a constant at small R. 

If I(R) is not circularly-symmetric, the galaxy cannot be spherically 
symmetric, but it might be axisymmetric. If we assume that this is the case, 
and that the line of sight to the galaxy lies within the galaxy’s equatorial 
plane, then J(R) and j(r) are again related by a soluble integral equation. 
Binney, Davies & Illingworth (1990) give a Richardson—Lucy algorithm suit- 
able for solving this equation for noisy data I(R). 

In general the line of sight will be inclined at an angle to an axisymmet- 
ric galaxy’s equatorial plane. In this case Rybicki (1987) showed that the 
image contains insufficient information to reconstruct the three-dimensional 
luminosity density j(r). Let the galaxy’s inclination i be the angle between 
the normal to the equatorial plane and the line of sight. Then the amount 
of missing information increases steadily from the edge-on case, 1 = 90°, 
in which j can be uniquely inferred from I to the face-on case i = 0°. In 
the face-on case it is easy to see that many different luminosity densities 
project to the same (circularly-symmetric) surface-brightness distribution: 
given one possible axisymmetric luminosity distribution, others can be gen- 
erated by moving stars parallel to the line of sight. In the case 0 < i < 90° it 
is not so easy to see how to change j(r) without changing J(R). However, for 
any angle i < 90° Gerhard & Binney (1996) display konus densities. These 
densities contain positive and negative regions arranged such that they are 
completely invisible at any inclination smaller than 7. Consequently, a konus 
density can be added to any possible luminosity density j(r) to generate 
another that projects to exactly the same surface-brightness distribution. It 
follows that whenever 7 < 90° it is in principle not possible to determine a 
galaxy’s true shape from photometry alone. 

Notwithstanding the existence of konus densities, the Richardson—Lucy 
algorithm of Binney, Davies & Illingworth (1990) works for inclinations less 
than 90° in the sense that it recovers one of the infinitely many luminosity 
densities that are compatible with given J(R). However, for i < 60° the 


10 Thus the simple relationship between power-law luminosity densities and surface 
brightnesses that holds at large radii and was noted above does not necessarily hold at 
small radii. 
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Figure 4.23 Contours of equal apparent axial ratio on the sphere of viewing directions. 
At left for an oblate body with n = 0.9, ¢ = 0.5 and at right for a prolate body with 
n = 0.6, ¢ = 0.5. The dashed lines show the directions of the axes of the ellipsoid, with 
the shortest axis upward and the longest axis to the left. The small circular contours at 
the upper left of each figure surround the directions at which the body appears circularly 
symmetric and q = 1. The contours run from q = 0.975 to g = ¢ + 0.025 in intervals of 
5q = 0.05. 


recovered luminosity density is liable to be sensitive to noise in the data and 
physically implausible. 

We shall see below that many galaxies cannot be axisymmetric. So 
let us suppose that a given galaxy is triaxial in the sense that it has three 
orthogonal symmetry planes, and investigate the relationship between its 
luminosity density j(r) and its surface-brightness distribution J(R). In the 
interests of simplicity we additionally assume that 7 is constant on ellipsoidal 
surfaces, that is, that 7 is a function of the variable 


2 
a =yfar+ +e (4.17) 


Here 7 and ¢ are the axis ratios of the ellipsoids of constant a,. Stark (1977) 
showed that when such a galaxy is projected onto the sky, its isophotes are 
similar, coaxial ellipses. Both the axis-ratio g and the orientation of these 
ellipses depend on the direction of the line of sight. Figure 4.23 shows q 
on the sphere of possible viewing directions for two triaxial bodies. There 
are always lines of sight along which the body appears circularly symmetric. 
Figure 4.36 below gives the probability of observing the body to have a given 
value of gq. 

The orientation on the sky of the elliptical isophotes of a triaxial body 
depends not only on the orientation of the body, but also on the body’s axis 
ratio. Figure 4.24 shows a prolate ellipsoid inside an oblate ellipsoid. Now, 
although the coordinate axes in the figure are the principal axes of both the 
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Figure 4.24 Isophotal twist as a consequence of triaxiality. Two 
concentric, coaxial ellipsoids are shown. The dashed lines mark the 
intersections of the ellipsoids with the coordinate planes, while the 
solid lines show their outlines to the observer. The arrows mark 
the directions of their apparent principal axes. 


inner and the outer ellipsoids, it is evident that the apparent major axis of 
the outer ellipsoid makes an angle of 50° with the apparent major axis of the 
inner body. This skew alignment of the apparent major axes arises from the 
fact that the axial ratios of the two bodies are different. The inner body has 
a:b:c = 1:2:1, whereas the outer body has a:b:c = 1:1:0.5. Imagine that these 
two bodies are two of the isodensity surfaces of an elliptical galaxy, and that 
the isodensity surfaces between these two have axial ratios such as 1:1.8:0.9, 
1:1.1:0.6, and so on, which form a continuous sequence between the ratios of 
the two surfaces drawn in Figure 4.24. Then, if the isodensity surfaces do 
have axial ratios that vary with increasing mean radius, the isophotes will 
be seen (from the given observing position) to twist continuously through 
a substantial angle. Thus if galaxies are triaxial, we expect variation in 
intrinsic axial ratios to be associated with isophote twist. By contrast, 
the isophotes of an axisymmetric system must always be aligned with one 
another. 

Two points are worth noting in connection with isophote twists. (1) If 
a galaxy happens to be aligned in space such that the line of sight is one of 
the principal axes, then its isophotes will not twist.1! (2) Although changes 


11 There is another much more remarkable case in which no twist will be observed — 
see Franx (1988). 
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Figure 4.25 The major-axis brightness profile of NGC 1700 plot- 
ted against (a) logr and (b) R1/4. [From data published in Capac- 
cioli, Piotto & Rampazzo (1988)] 


with radius in the true axial ratios generate twists, it does not follow that, in 
any particular galaxy, an isophotal twist must be accompanied by a change 
in the apparent axial ratio. To specify a triaxial model, one has to determine 
the two true axial ratios, but fixing the apparent axial ratio at some definite 
value still leaves one degree of freedom, which allows the position angle of 
the observed isophotes to vary. 


4.3 Photometry of Elliptical Galaxies 


To a good approximation elliptical galaxies are transparent. Hence the lu- 
minosity of these systems may be considered to be the unattenuated light 
from myriads of individual stars. 


4.3.1 Radial surface-brightness profiles of elliptical galaxies 


Figure 4.25 shows the surface brightness as a function of distance R along the 
major axis of the giant elliptical galaxy NGC 1700. In the left-hand panel 
LB is plotted against log R, while in the right-hand panel the independent 
variable is R'/*. If 4g varied as a power of R, the points in panel (a) would 
lie on a straight line. At the smallest and largest radii the data can be crudely 
fitted by straight lines and from the slopes of these lines one infers ] ~ R~°® 
at small R and J ~ R-1"’ at large R. To a remarkably good approximation, 
the points in the right-hand panel lie on a single straight line. This indicates 
that ug = «—yR"/*, where z and y are positive constants, and therefore that 
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I x 107°-4#2 x 10-949" | Since this relationship between J and R!/4 was 
discovered by de Vaucouleurs (1948), it is known as the de Vaucouleurs 
R‘/4 law. It is conventionally written 


I(R) = I.10{-3381(R/R.)"/*—1) 


= I, exp{—7.67|(R/Re)'/* — 1}. 
The length scale R, is known as the effective radius, and the numerical 
factor 3.33 in (4.18) is chosen such that, if the galaxy image were circularly 
symmetric and if the formula were valid all radii, both large and small (and 
it must be remembered that its validity has been established only for a 
restricted range of radii), then one-half of the total light of the system would 
be emitted interior to R,. Thus 


(4.18) 


Re foe) 
2 | dRI(R)27R = i. dRI{R)2nR 
° 2 (4.19) 
__ 8! exp(7.67) 
~ (7.67)8 
The parameter J, is clearly the surface brightness at R = R,, and, according 
to the R'/4 law, the central brightness of the galaxy is 10°97, ~ 20001. 
From equation (4.19) it follows that the mean surface brightness interior to 
R, is (I). = 3.61]. 

Unless an elliptical galaxy is exactly circularly symmetric, different val- 
ues of the effective radius will be obtained by fitting (4.18) to the galaxy’s 
brightness profiles along its major and minor axes. Quoted effective radii 
are usually the geometric mean of these two values, say a, and b.. Thus 
R. = (aeb.)!/*. With this definition, half the total light is predicted to be 
contained within the isophotal ellipse that has area 7R?. 

Given that it contains only two free parameters, J, and R,, the R1/4 
law provides a remarkably good fit to the surface-brightness profiles of many 
elliptical galaxies. However, as Figure 4.27 illustrates, the brightness profiles 
of elliptical galaxies show genuine individuality and some deviate significantly 
from the best-fitting R'/* profile. Box 4.1 describes some three-parameter 
fitting functions that have been employed to parameterize the brightness 
profiles of elliptical galaxies more completely than the R!/4 law can, but 
none of these formulae is as important as the R1/4 law. 

To some extent the deviations of galaxies from the R!/4 law depend 
systematically on the galaxies’ luminosities. Specifically, when yu is plotted 
versus R1/*, the surface-brightness profiles of low-luminosity galaxies tend to 
curve downwards at all values of R1/4, whereas the surface-brightness pro- 
files of the most luminous galaxies curve upwards over significant intervals 
in R'/4, This phenomenon is illustrated by Figure 4.26, which shows char- 
acteristic luminosity profiles for galaxies whose luminosities inside 16 kpc lie 
within six ranges. 


(wR?I.) = 7.2207 R71. 


cD galaxies Galaxies whose surface-brightness profiles turn strongly up- 
wards in a (u, R1/4) plot are called cD galaxies. Figure 4.28 shows the 
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Box 4.1: Three-Parameter Fitting Functions 


Fitting functions that contain more than two free parameters would be 
expected to yield good fits to a larger range of galaxies than can be 
obtained with the R!/4 law. One possibility is to replace the 1 /4 in the 
R1/4 law with 1/n, where n is fitted to the data. That is, the data are 
fitted by the generalized de Vaucouleurs law 


I(R) = Te10{—bnl(R/ Re)!" — 1} (1) 


The constant 5b, is again chosen such that half the total luminosity pre- 
dicted by the law comes from R < R, (Sersic 1968, Ciotti 1991). When 
n <4, equation (1) predicts a brightness profile that curves downwards 
in a plot of 4 versus R!/4, whereas when n > 4 the model profile curves 
upwards in this plot. Hence when equation (1) is fitted to data for a 
low-luminosity elliptical galaxy, the best-fitting value of n is likely to be 
smaller than 4. Conversely, when equation (1) is fitted to data for very 
luminous elliptical galaxies, particularly cD systems, values of n in excess 
of 4 are usually found. 

Another three-parameter fitting function that is sometimes used is 
the Hubble—Oemler law 


Ipe7F/ Fe 

se eer Ee TA (2) 
Here Jp is the central surface brightness, and ro is the radius interior 
to which the surface brightness is approximately constant. For R in 
the range ro < R < R;, the surface brightness varies approximately as 
I ~ R-?. Beyond R; the surface brightness falls off rapidly, with the 
consequence that the law predicts finite total luminosity. In the limit 
Rt — 00, equation (2) reduces to the Hubble law 


Io 
(4 R/r)? (3) 


This functional form was introduced by Reynolds (1913) and popular- 
ized by Hubble (1930). Note that in Hubble’s law, the quantity R/ro 
is squared after it has been added to unity, rather than before, as in 
the modified Hubble law of equation (4.13). In consequence of this dif 
ference, the central gradient of the surface brightness, dI/dR ney 38 
non-zero according to the Hubble law but zero according to the modified 
Hubble law. Both laws predict J ~ R~? for R > ro and therefore result 
in an unphysically-infinite value for the total luminosity of the galaxy 
being modeled. 


I(R) = 
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Figure 4.26 Characteristic surface- 
brightness profiles for elliptical galax- 
ies of different luminosities. The 
luminosities are expressed in terms 
of absolute V-band magnitudes and 
correspond to the portion of the gal- 
axy that lies in projection interior to 
a circle of radius 16kpc. These pro- 
My =-17_-19 —21 —22 -22.5 files are based on photometry of 261 
L Sie feb ih A elliptical galaxies. [After Schombert 
(1986) from data kindly provided by 
J. Schombert] 
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Figure 4.27 Surface-brightness pro- 

files of NGC 720, NGC 1199, NGC 

1209, NGC 1395 and NGC 1426. 

1 2 3 1/4 4 5 [From data published in Capaccioli, 
(R /arcsec) Piotto & Rampazzo (1988)] 


surface brightness profile of the cD galaxy at the center of the cluster Abell 
1413. These galaxies are always luminous and tend to lie in regions of excep- 
tionally high galaxy density, such as the center of a cluster of galaxies (Beers 
& Geller 1983) or of a compact group (Morgan, Kayser & White 1975). The 
picture of the Coma cluster shown in Figure 4.29 illustrates this phenomenon. 
Schombert (1986) finds that J « R7*® in the halos of cD galaxies. 

The displacement at large R of the surface-brightness profiles of these 
galaxies above the R1/* law is interpreted as being due to the galaxy being 
embedded in an extensive luminous halo. It may be profitable to think of this 
halo as belonging to the surrounding cluster of galaxies rather than to the 
galaxy itself (Merritt 1985). In particular, in normal elliptical galaxies the 
velocity dispersion is constant or falling as one moves outwards (see §11.2.1), 
whilst in cD galaxies the velocity dispersion rises outwards into the halo 
(Dressler 1979, Carter et al. 1985). The increase implies that the velocity 
dispersion in the halo is comparable to that of the inner galaxy cluster as 
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Figure 4.29 The cD NGC 4881 is located near the center of the 
Coma cluster and is surrounded by a swarm of much less luminous 
galaxies. [Figure courtesy of STScI] 


a whole. The ellipticity of cD galaxies generally increases outwards and the 
principal axes of the outer isophotes are always aligned with the principal 
axes of the isopleths (curves of constant number-density of galaxies) of the 
surrounding cluster {Carter & Metcalfe 1980, Rhee & Katgert 1987). This 


190 Chapter 4: Morphology of Galaxies 


20 T T T T T T T TT T T T T T T T 


o2:) 


7 ee (ae pe ee 
aa 10 20 30 


R/arcesec 


Figure 4.30 The full curve shows the surface-brightness profile 
of a typical dE galaxy in the Virgo cluster from the photographic 
photometry of Ichikawa, Wakamatsu & Okamura (1986). Note that 
the profile is nearly straight in this log-linear plot because it is well 
fitted by the exponential law (4.20). The dashed curve shows the 
surface brightness profile of another galaxy from the same study. 
This galaxy approximately obeys the R}/4 and is thought to be a 
background giant elliptical galaxy. 


phenomenon again points to cD halos belonging to the cluster rather than 
the galaxy. Finally, and most crucially, Struble (1988) discovered in the 
galaxy cluster Abell 545 what appears to be a cD halo that lacks a central 
galaxy. 

Dwarf elliptical galaxies The majority of galaxies in the Local Group 
are low-luminosity elliptical galaxies of the dE or dSph classes and it is 
likely that such dwarf ellipticals are the commonest type of galaxy in every 
environment. Certainly they exist in great numbers in the nearby clusters of 
galaxies such as the Virgo and Fornax clusters (Bingelli et al. 1985, Wirth 
& Gallagher 1984). On account of their low luminosities, few are found 
in magnitude-limited samples and in consequence they have not been as 
extensively studied as the giant ellipticals with which we have been largely 
concerned above. 

There is evidence that dwarf ellipticals fall into two classes, compact 
ones and diffuse ones (Kormendy & Djorgovski 1989). The only compact 
dwarf elliptical in the Local Group is M32, which lies very close to the Milky 
Way’s giant companion, M31. However, numbers of compact dE galaxies 
that are not close companions of any giant galaxy have been found in nearby 
galaxy clusters. The radial surface-brightness profiles of these galaxies, which 
are reasonably well fitted by the R1/4 law, fall at one end of the sequence of 
profiles for giant elliptical galaxies shown in Figure 4.26. 
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Figure 4.31 The surface-brightness profiles of four early-type 
galaxies observed by the HST. [After Lauer et al. (1995) from data 
kindly provided by K. Gebhardt] 


By contrast, the radial surface-brightness profiles of the other class of 
dE galaxies, the diffuse ones, are best fitted by the exponential law 


I(R) = Igexp(—R/Ra). (4.20) 


Figure 4.30 illustrates this phenomenon. We shall see below that the expo- 
nential law is the hall-mark of galactic disks, with the result that its param- 
eter Rg is called the disk scale length. However, an analysis, along the 
lines described in §4.3.3 below, of the distribution of the apparent ellipticities 
of diffuse dE galaxies indicates that these are not highly flattened objects 
(Ichikawa, Wakamatsu & Okamura 1986). 


Centers of elliptical galaxies Since it has long been suspected that mas- 
sive black holes lurk at the centers of elliptical galaxies, an enormous amount 
of effort has been devoted to understanding these regions. Much of this work 
has been frustrating because the nearest elliptical galaxies (such as M32) are 
small systems, and the largest specimens lie far away. Consequently, the 
effects of seeing seriously degrade ground-based observations of the crucial 
central regions of these systems and completely convincing observations had 
to wait until the launch of the HST. 

Lauer et al. (1995) and Byun e¢ al. (1996) have analyzed the surface- 
brightness profiles that were measured by the HST (unfortunately prior to 
its refurbishment in March 1994) for 42 early-type galaxies that have neither 
an optically luminous AGN nor conspicuous absorption by nuclear dust. 
Each isophote was fitted by an ellipse with semi-axes a and b, and then 
the isophote’s surface-brightness was plotted against r = ab. Figure 4.31 
shows some of these profiles, which extend to 10” in radius, with an effective 
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Figure 4.32 The luminosity densities of 40 early-type galaxies 
with HST surface-photometry. The point of sharpest curvature 
on each profile has been shifted on this log-log plot to r = rp, 
j = 1. Each curve is obtained by least-squares fitting a spline to 
the observed surface-brightness profiles and then applying equation 
(4.12) to the resulting spline. [After Gebhardt et al. (1996) from 
data kindly provided by K. Gebhardt] 


spatial resolution of better than 0.1’. This figure shows at a glance (i) that 
the central brightness profiles display considerable individuality, and (ii) that 
many brightness profiles continue to rise steeply at the smallest measured 
radii. 

This last observation suggests that many of these galaxies have surface- 
brightness profiles that diverge as J ~ R~™® when R — 0. From §4.2.3 we 
recall that divergent central surface-brightness profiles are only generated 
by luminosity densities that diverge more strongly than j ~ r~). Hence it 
seems that many of the galaxies observed by the HST have strongly divergent 
central luminosity densities. Moreover the galaxies that show no evidence of 
divergent surface-brightness profiles probably have luminosity densities that 
diverge as r — 0, but do so sufficiently slowly for [(R) to remain finite as 
R-O0. 

Gebhardt et al. (1996) have used equation (4.12) to examine the lumi- 
nosity densities of the galaxies observed by Lauer et al. (1995) and Byun 
et al. (1996). Figure 4.32 shows their estimate of each galaxy’s luminosity 
density, scaled in radius and luminosity so as to place the point on the curve 
that has the greatest curvature at 7 = 1, r= 1. At r < 1 the curves clearly 
fall into two groups. The curves of one group rise steeply towards r = 0, 
clustering around the power law j = r—1-°. The curves of the second group 
cluster around j = r—°-8. In each group most curves are reasonably straight, 
so that 7 may be approximated by a power-law in r, although one curve in 
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the second group bends steadily downwards as r — 0, suggesting that in this 
galaxy 7 may tend to a finite limit as r — 0. At r > 1 all curves appear to 
belong to a single group that clusters around j = r~?. 


4.3.2 Color and line-strength gradients in elliptical galaxies 


Elliptical galaxies are redder near their centers than further out. For ex- 
ample, Franx, Illingworth & Heckman (1989a) find that the 17 galaxies in 
their sample exhibit mean color gradients d(U — R)/dlog,, R ~ —0.25 and 
d(B — R)/dlog,, R = —0.1!? Peletier et al. (1990) find similar mean color 
gradients in a sample of 39 elliptical galaxies and conclude that contours of 
constant color coincide with isophotes. These color gradients are of great 
interest because they are almost certainly due to gradients in the mean ages 
and metallicities of the galaxies’ stars. However, as we saw in §4.2, color 
gradients are exceptionally difficult to determine because they depend sen- 
sitively on the sky backgrounds estimated for each band. Fortunately, they 
should be associated with changes in the strengths of various spectral lines 
in the galaxies’ spectra, and it is easier to measure accurately changes in 
spectrum than in color in the presence of an ill-determined sky background. 
Hence much recent work has probed gradients in the compositions of galaxies 
through spectral indices. 

Davies, Sadler & Peletier (1993) measured several of the spectral indices 
defined in Table 3.8 out to R ~ R, in 13 elliptical galaxies. They found that 
the indices tended to be constant on isophotes. H@ generally increases with 
radius while Mg;, Mg and (Fe) decrease with radius. Using the rather 
uncertain calibrations discussed in §3.4, they conclude from the Mg and Fe 
indices that 

alFe/H] _ _ 9 2940.09. (4.21) 
dlog,)r 
This gradient agrees with earlier work on index gradients and on color gra- 
dients. However, it must be stressed that the interpretation of indices in 
terms of metallicity is very uncertain. In particular, variations in the mean 
ages of stars at different radii can have a similar effect on indices to that of 
metallicity variations — see Figure 5.23. 

The origin of the outwards increase in Hf is controversial. One possi- 
bility is that at small radii extended HG emission is filling in the absorption 
lines in the stellar spectra, thus diminishing the HG index. An alternative 
explanation is that the mean age of the stars increases from the center out- 
wards, perhaps because accreted gas has caused new stars to form at small 
radii within the last few Gyr. 

Davies, Sadler & Peletier (1993) found only weak correlations within 
their sample between the index-gradients and other measured quantities such 


12 See Figure 5.20 fop other evidence of color gradients in ellipticals. 
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as central velocity dispersion. The best correlation is between the gradient 
in Mg and the gradient in U— R. However, the measurements are subject to 
substantial errors, which are capable of degrading tight correlations between 
the variables into weak ones such as those observed. 


4.3.3 Shapes of elliptical galaxies 


Ellipticity In §4.2 we saw that the elongation of isophotes is commonly 
characterized by the ellipticity «. When the effects of seeing discussed in 
84.2.2 are discounted, the ellipticity sometimes increases from the center 
outwards (as in cD galaxies), sometimes decreases and sometimes varies non- 
monotonically as a function of radius. In short, no general rule seems to 
govern the variation of € with R, and it is not clear what physical significance 
attaches to the observed variations. Consider therefore assigning a single 
characteristic ellipticity to each galaxy. Fasano & Vio (1991) have used 
CCD photometry to assign to each of 204 elliptical galaxies the ellipticity of 
isophotes near R, (which is where most of the galaxy’s light in concentrated). 
Lambas, Maddox & Loveday (1992) have determined average ellipticities for 
20 399 galaxies that were automatically measured on survey plates; 2135 of 
these galaxies were ellipticals. Figure 4.33 shows the frequency distribution 
of the Lambas et al. data for ellipticals. The distribution rises gradually from 
zero at € = 0.8 to a broad peak around ¢ = 0.25 and then falls sharply as 
€ = 0 is approached. The smaller sample of Fasano & Vio (1991) is consistent 
with this distribution. 

What can we learn about the true shapes of elliptical galaxies from the 
data of Figure 4.33? Let us start by assuming that all elliptical galaxies 
are spheroidal (that is, either oblate or prolate figures of revolution). Con- 
sider then a spheroidal galaxy whose isodensity surfaces are described by the 
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Figure 4.34 Evaluating the appar- 
ent axial ratio of a rotationally sym- 
metric galaxy. 


equation 
(Gx)? + (Cy)? +2? = c?. (4.22) 


If ¢ > 1, the galaxy will be prolate, and if ¢ < 1, it will be oblate. In 
either case, c is half the length of the symmetry axis of the isodensity surface 
that is defined by equation (4.22); the x and y semi-axes of this surface are 
each of length (c/¢). Consider the appearance of the galaxy to an observer 
whose line of sight makes an angle 9 with the z axis (Figure 4.34 shows the 
geometry of this situation). Without loss of generality, we may take the y 
axis to be the line of nodes,!3 so that one apparent semi-axis has length (c/¢) 
and the other principal semi-axis has length A, as shown in Figure 4.34. In 
the notation of the figure, 


(6x0)? + 28 = c?, (4.23) 


and, by differentiation of this equation, we find for the slope s of the tangent 
at (0, Zo) 


2 
ee ee (4.24) 
20 
ee 4.2 2 
cot? 6 = ¢ a =? (S - 1) . (4.25) 
20 20 


which implies 


=—— +1. 4,26 
a7 a * oe 
But A 
C=- aS 2 — Los 
sin 
Ca2 c2 (4.27) 
= 29 + 0 =— 
20 ZO 


13 An object’s line of nodes is the line through the center of the object that lies in 
both the plane of the sky and the object’s equatorial plane. 
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Thus, using equation (4.26), we obtain 


2 2 
Aad (3) sin? @ = c? (S ner 6) ; (4.28) 
0 


In the oblate case, the apparent axial ratio (when defined to be less than 1) 
is g = ¢A/c, and q = c/CA in the prolate case. Therefore we have 


2 
C? sin? 6 + cos? 6 = { q (oblate), (4.29) 


1/q* (prolate). 


Let us now use equation (4.29) to determine the distribution of apparent 
ellipticities that will arise from a large number of spheroidal galaxies whose 
symmetry axes are randomly oriented about the line of sight. If the symmetry 
axes of the galaxies are randomly oriented with respect to the line of sight, 
then, of the n(¢) d¢ galaxies under discussion, a fraction sin@d@ will have 
their symmetry axes directed at angle 6 to the line of sight. By equation 
(4.29), q is a function g(6) of 6. To have q between gq and q + dg, the 
symmetry axis of the galaxy has to be oriented between 6 and 6+ d6, where 
dé = dq/|dq/d6|. Therefore the probability P(q|¢)dq that a galaxy with true 
axis ratio ¢ is observed to have apparent axis ratio in the range (q,q+ dq) is 


P(ql¢)dq = awa (4.30) 


Let there be f(q) dg galaxies with axial ratios in the range (q, g+ dq). This 


number is equal to f d¢n(¢)P(q|¢)dg. So equating these expressions and 
dividing out dq we have 


a= | anor (4.31) 


From equation (4.29), one has for oblate geometry, 


l= la- Pe - en? 


(1-4?) 1/2 (4.32) 
sin@ = | —— ; 
mal 
and for prolate geometry, 
dq 2\7 2,2 1/2 
G| = all - Pee? - 1) 
(4.33) 
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Figure 4.35 Probability den- 

sity of observing axis ratio q for a 

spheroidal system of true axis ra- 

; oh tio ¢. The full curve corresponds to 

c : ae oblate geometry and ¢ = 0.3, while 

0. 02 ~~ (UL (OG OS the dashed curve is for prolate geom- 
q etry and ¢ = 0.5. 


For prolate geometry, equation (4.31) therefore becomes 


re ia n(¢) d¢ 
fg) = q? iP [(¢2 — 1)(C2q? — 1)}1/2 ; (4.34) 


In the prolate case, if the true axial ratio 3 is considered to be a quantity less 
than 1, then one has G = 1/¢. Therefore, if we define N(@) to be the number 
density of galaxies with true axial ratios near 3, we have N(G) d@ = n(¢) dé, 


and 
-4[ —_ NBM AB 
(1 — B)(q? — 6?)}1/?" 


In the oblate case, 8 = ¢ and equations (4.31) and (4.32) lead immediately 


to 

_ yaef" __ N@as ee 

*)o (-FP—P : 

Equations (4.35) and (4.36) are integral equations relating the known (ob- 
served) frequency of galaxies of apparent axial ratio g to the unknown fre- 
quency N(@) of galaxies of true axial ratio @. The only difference between 
them is the extra factor of (@7/q?) in the prolate case, which reduces the 
contribution of highly aspherical (8 < 1) galaxies to the observed number of 
apparently round galaxies. Lambas, Maddox & Loveday (1992) solve these 
equations for polynomial fits to the data of Figure 4.33 and find that, for 
both prolate and oblate geometry, N() turns negative near 3 = 1. Since it 
is logically impossible for a frequency function such a N(3) to be negative, 
this result implies that something is seriously wrong with either the data or 
its analysis. 

The nature of the problem is made clear by the full curve in Figure 4.35, 
which shows, for oblate geometry and ¢ = 0.3, the probability density P(q|¢) 
that is given by substituting equations (4.32) into equation (4.30). This 
declines from very large values near g = ¢ to a plateau that continues to 
q = 1. When an infinite number of such distributions (one for each value 
of ¢) is added together, it is clear that the resulting summed distribution 


(4.35) 
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Figure 4.36 P(q|n,¢) for 7 = 0.8, 
¢=0.3. 


will be fairly flat near gq = 1. By contrast, the observed distribution of 
Figure 4.33 shows a pronounced drop in the vicinity of g = 1. The result 
of Lambas, Maddox & Loveday (1992) is that this drop cannot be produced 
by any sum of distributions like the full curve in Figure 4.35. The dashed 
curve in Figure 4.35 shows that the apparent axis ratio q of a prolate system 
is much more likely to lie near the true axis ratio ¢ and less likely to lie near 
unity than in the oblate case. Nevertheless, P(q|¢) remains non-zero in the 
neighborhood of g = 1, so the study of Lambas, Maddox & Loveday (1992) 
finds that the drop in the observed distribution of ellipticities near g = 1 
cannot be reproduced by any distribution of prolate spheroidal systems. 

Figure 4.36 shows the probability density P(q|n,¢) of observing axis 
ratio q for a triaxial system in which the luminosity density is a function 
j(ay) of the ellipsoidal radius that is defined by equation (4.17). For the 
distribution shown, 7 = 0.8 and ¢ = 0.3; the body in question is strongly 
flattened and only slightly elliptical in the ry-plane. The probability density 
diverges at two values of g, namely g = ¢/n and q = n, and falls steeply to 
zero at both qg = ¢ and q = 1. It is clear that the steep decline in P(q|n, ¢) 
as g = 1 is approached has the potential to fit the depression near g = 1 in 
the observed distribution of Figure 4.33, and Lambas, Maddox & Loveday 
(1992) do indeed find a distribution of triaxial bodies that can account for 
the data. In this distribution ¢ is Gaussianly distributed with central value 
Co = 0.55 and dispersion o¢ = 0.2, while 7 is likewise Gaussianly distributed 
with central value mo = 0.95 and dispersion o, = 0.35. (The Gaussian 
distributions are obviously truncated at 0 and 1 and are further constrained 
such that 7 > ¢.) 

The distribution derived by Lambas, Maddox & Loveday (1992) is not 
unique, but it seems likely that any acceptable distribution will be dominated 
by systems which are nearly oblate axisymmetric bodies, rather than nearly 
prolate axisymmetric objects. 

This conclusion is strongly reinforced by the observation that galaxies 
of all types frequently exhibit the phenomenon of isophote twist that was 
discussed from a theoretical view-point in §4.2.3. As an example, Figure 4.37 
shows the twisted isophotes of the E3 galaxy NGC 5831. In §4.2.3 we saw 
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Figure 4.37 Isophotes (at 0.5 py in- 
tervals) of the E3 galaxy, NGC 5831. 
The heavier lines indicate the semi- 
major axes of the ellipses that best 
fit the isophotes at two radii. These 
‘ axes have lengths a = 4 arcsec and 
ae a = 40Qarcsec. [From data kindly 
Swe A - E provided by R. Bender] 


that axisymmetric objects cannot have twisted isophotes, whereas triaxial 
systems are expected to exhibit them. Unfortunately, it has not proved 
possible to use observations of isophote twists to quantify the triaxiality of 
galaxies. 

In §11.2.1 we shall see that kinematic data also lead to the conclusion 
that many galaxies are mildly triaxial. 


Deviations from ellipses In §4.2 we saw that deviations of isophotes 
from ellipses may be conveniently quantified by the Fourier coefficients a, 
defined by equation (4.10). Significantly non-zero values of a3/a, bs/a, bs/a 
and a4/a are frequently measured, but the higher Fourier coefficients in equa- 
tion (4.10) are usually smaller than the measurement errors. Peletier et al. 
(1990) find that a3 and bg are sensitive diagnostics for the presence of dust. 
in elliptical galaxies. As we have seen, the diskiness a,/a is sensitive to the 
presence of an embedded disk. 

Normal elliptical galaxies have values of a,/a that are smaller than a few 
per cent. Larger values are easily detectable by eye and an object with large 
|a,/a| has in the past been classified as either an SO or an irregular object. 
The least round galaxies have the largest values of |a4/a|. In fact, galaxies 
with € 2 0.35 almost always have |a4/a| > .005 and are therefore either disky 
or boxy (Bender e¢ al. 1989). This suggests that any elliptical galaxy will 
have non-elliptical isophotes when viewed edge-on, and apparently elliptical 
galaxies are those that are viewed at inclinations i < 60°. Indeed, we know 
from the deprojection studies mentioned in §4.2 that one cannot tell whether 
a galaxy is really disky or boxy when one views it at 7 S 60°. 

Bender et al. (1989) discuss some intriguing correlations between the 
diskiness a,/a and other, apparently unrelated, properties of elliptical galax- 
ies. Figure 4.38 shows plots against a4/a of (i) radio power at 1.4GHz, and 
(ii) X-ray luminosity. The left panel shows that essentially all disky ellipti- 
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Figure 4.38 Disky galaxies show neither much X-ray power nor much radio power. In 
the left-hand panel, X-ray luminosity is plotted against a4/a, while the right-hand panel 
shows a plot of power at 1.4GHz against a4/a. There is a conspicuous absence of points 
in the upper-right quarter of each panel. [From data published in Bender et al. (1989)] 


cals are only weak radio sources (P;,4 < 1024 WHz~*), while boxy ellipticals 
are frequently up to a factor of 10000 more powerful at 1.4GHz. The di- 
vision between the two groups is remarkably abrupt, at a4/a ~ 0.005. The 
right-hand panel in Figure 4.38 shows that all disky ellipticals in the sample 
have X-ray luminosities below 3 x 1093 W, while most boxy ellipticals have 
larger X-ray luminosities. This correlation becomes even more striking if one 
decomposes the measured X-ray luminosities of the galaxies into the prob- 
able contributions from binary stars and the interstellar medium (see §8.3): 
this decomposition suggests that only boxy galaxies have X-ray luminous 
interstellar media. 

The correlations shown in Figure 4.38 are the more striking because 
we know that a4/a depends on the inclination at which we view a galaxy, 
whereas P; 4 and Lx do not. Therefore even if there were a perfect correlation 
between the true shapes and luminosities of elliptical galaxies, there would 
still be an imperfect correlation between their apparent shapes and their 
luminosities. 

Are the correlations shown in Figure 4.38 reflections of other correla- 
tions, or somehow intrinsic? It is known that both radio power and X-ray 
luminosity are strongly correlated with optical luminosity — see §8.3. Hence 
the correlations of Figure 4.38 would be expected if diskiness were correlated 
with optical luminosity. Bender et al. (1989) show that any such correlation 
is weak. Hence it seems that the correlation of diskiness with radio and X-ray 
luminosities is real. Its origin is not adequately understood. 

A possible key to this puzzle is provided by examination of the connec- 
tion between diskiness and the rate at which elliptical galaxies rotate. In 
§11.2.1 we define the indicative rotation rate (v/o)*, which is of order unity 
for systems that are flattened by rotation, and much less than unity if rota- 
tion is dynamically unimportant. The left-hand panel of Figure 4.39 shows 
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Figure 4.39 Boxy galaxies frequently rotate very slowly, and then about an axis other 
than the apparent minor axis. The left panel shows that the indicative rotation rate 
(v/o)* [equation (11.14)] is invariably of order unity for disky galaxies but sometimes 
very small for boxy galaxies. The right-hand panel shows that boxy galaxies frequently 
show a non-negligible velocity gradient along their minor axes. [From data kindly provided 
by R. Bender} 


that (v/o)* is invariably of order unity for disky galaxies, which suggests that 
these systems are indeed flattened by their spins. The right-hand panel of 
Figure 4.39 lends support to this conjecture by showing that in these galax- 
ies the velocity gradient along the apparent major axis is invariable much 
larger than that along the apparent minor axis, just as we expect if these 
systems are axisymmetric bodies that rotate about their short axes. Since, 
by contrast, many boxy galaxies have small values of (v/c)*, rotation is often 
dynamically unimportant in these systems. Moreover, the right-hand panel 
of Figure 4.39 shows that there is often a significant velocity gradient along 
the apparent minor axis of a boxy galaxy. This may be because these objects 
are triaxial, in which case the apparent minor axis is not expected to be of 
any dynamical significance. 

The correlations with a4 that are shown in Figure 4.39 strongly sug- 
gest that many boxy galaxies are triaxial systems with little net rotation. 
We would not expect the ISM in such a system to have much net rotation 
either. Moreover, triaxiality of the galaxy’s potential will facilitate the ex- 
change of angular momentum between the stellar component and the ISM. 
Consequently, in a boxy galaxy interstellar gas may not be prevented from 
reaching the nucleus by a centrifugal ‘barrier’ of the type that operates in 
a disky galaxy. This absence of a centrifugal barrier may account for the 
propensity to powerful radio and X-ray emission in boxy galaxies that is 
demonstrated by Figure 4.38. 


Fine structure The three-parameter fitting functions discussed above 
provide a good description of the largescale properties of elliptical galax- 
ies. However, detailed photometric studies have revealed that some galaxies 
contain significant small-scale departures from these smooth radial profiles. 
Sadler & Gerhard (1985) estimated that about 40% of ellipticals contain 
dust lanes — the fraction of systems in which dust lanes have been detected 
is significantly smaller because the visibility of a dust lane varies significantly 
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Figure 4.40 The elliptical galaxy NGC 3923. This negative image 
from the Anglo-Australian Telescope has been manipulated to en- 
hance the appearance of sharp-edged features; this process reveals 
the presence of a number of ripples or shells. [Courtesy of D. Malin] 


with its orientation to the line of sight. 

Between 10% and 20% of early-type galaxies are found to contain sharp 
steps in their luminosity profiles like those apparent around NGC 3923 in 
Figure 4.40 (Schweizer 1980, Malin & Carter 1983, Sadler & Gerhard 1985). 
These features are called ripples or shells. In many, but by no means all 
systems (Prieur 1990), ripples tend to form arcs of ellipses that are centered 
on the galactic nucleus. In highly developed systems of ripples such as that 
shown in Figure 4.40, the radii of ripples on opposite sides of the galaxy are 
interleaved. That is, if on one side there are ripples at r = a1, a2,... and on 
the opposite side ripples occur at r = 61, b2,..., then ay < by < ag < bo <--s. 
The radii of ripples within a single galaxy can differ by a factor of 30 or more 
— in the case of NGC 3923 the innermost ripple is at r ~ 3kpc while the 
outermost lies at r ~ 95kpc. Ripples generally involve 3-5% deviations 
of the galaxy’s surface-brightness profile from a smooth underlying law, and 
their colors indicate that their light comes from the kind of stellar population 
that one would expect to encounter in an early-type galaxy (Pence 1986). 

Schweizer & Seitzer (1988) have shown that ripples occur in SO and 
Sa galaxies as well as in ellipticals. Since their detectability in early-type 
systems depends upon the contrast between their sharp edges and the smooth 
background brightness profiles of the underlying galaxies, ripples would be 
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very hard to detect in late-type systems, so we do not know how universal a 
phenomenon they are. 

A stellar system can display a sharp edge only if some parts of its phase 
space (see BT §4.1) are very much more densely populated with stars than 
neighboring parts of phase space. In the classical dynamical model of an 
elliptical (BT §4.4), phase space is populated very smoothly. Therefore the 
existence of ripples directly challenges the classical picture of ellipticals. One 
likely possibility is that ellipticals acquire ripples late in life as a result of 
accreting material from a system within which there are relatively large gra- 
dients in phase-space density. Systems with large density gradients in phase 
space include disk galaxies and dwarf galaxies: in a thin disk the phase-space 
density of stars peaks strongly around the locations of circular orbits, while 
in a dwarf galaxy all stars move at approximately the systemic velocity, so 
that there is only a small spread in velocity space. 

Numerical simulations suggest that ripples can indeed form when mate- 
rial is accreted from either a disk galaxy or a dwarf system — see Barnes & 
Hernquist (1992) for a review. Moreover, simulations have successfully re- 
produced the interleaved property of ripples described above. Despite these 
successes significant uncertainties still surround the ripple phenomenon be- 
cause the available simulations have important limitations, and it is not clear 
how probable their initial conditions are. 

Schweizer et al. (1990) defined an index ¥ that quantifies the amount 
of fine structure such as ripples that a galaxy possesses: 


D=St+logilt+n)+J+B+X. (4.37) 


Here S measures the strength of the most prominent ripple on a scale of 0 to 
3; n is the number of detected ripples; J is the number of optical ‘jets’; B is 
a measure of the boxiness of the galaxy’s isophotes on a scale of 0 to 4; X is 
0 or 1 depending on whether the galaxy’s image shows an X structure. For 
a sample of 69 nearby early-type galaxies © varies from 0 to 7.6. Notice that 
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x is defined such that it does not include a term sensitive to the presence of 
dust — this definition is motivated by the observation that while ripples, jets, 
boxiness and X structures tend to occur together, no correlation between 
these features and dust lanes is apparent (Sadler & Gerhard 1985, Schweizer 
& Seitzer 1992). Schweizer et al. (1990) and Schweizer & Seitzer (1992) find 
that X is correlated both with line-strengths and with broad-band colors in 
the sense that the larger © is, the weaker are system’s lines and the bluer its 
colors; Figure 4.41 shows the correlation of © with B—V for a sample of 32 
elliptical galaxies. As we shall see in §5.4, the most natural interpretation 
of this finding is that galaxies that have significant fine structure contain 
larger numbers of relatively youthful stars than do galaxies that lack such 
structure. This is very much what we would expect if fine structure formed 
during accretion events since there is evidence that such events are associated 
with bursts of star formation ~ see §4.6.1. 


4.3.4 Correlations among global parameters of elliptical galaxies 


The basic structural parameters of an elliptical galaxy can be divided into 
two sets. One set consists of the shape parameters €, a4/a and (v/o)* dis- 
cussed above. The other set consists of the shape-independent parameters, 
namely the luminosities, Dx, in various bands X, effective radius, R, (mea- 
sured in kpc), effective surface brightness, J,, mean colors, B — V etc, line- 
strength indices, Mg etc., and central velocity dispersion oo. Just as there 
are correlations between the shape parameters, there must be correlations 
between the shape-independent parameters. For example, it is obvious that 
more luminous galaxies will be physically larger, so it would be very sur- 
prising if Lg and R, were not positively correlated. These correlations are 
enormously important for both practical and theoretical cosmology. Their 
practical importance arises because they relate quantities such as op that can 
be determined without a knowledge of a galaxy’s distance, to quantities such 
as Lg that can be determined only for galaxies of known distance. Hence 
these correlations can be used to infer the distances to galaxies. Their impor- 
tance for theoretical cosmology is that they provide much-needed constraints 
on theories of galaxy formation. 

We saw above that the observed gradients within galaxies in colors such 
as B — R and line-strength indices such as Mgo are consistent with one 
another. Similarly, Bender et al. (1992) report that the B — V colors of E 
and SO galaxies are tightly correlated with their measured Mgz values — see 
Figure 4.42. The tight relationship between colors and line strength in the left 
panel of Figure 4.42 is a non-trivial result because the two variables plotted 
relate to different volumes within each galaxy: the Mg2 index depends only 
on the spectrum of light from the innermost few arcseconds of the image, 
while the measured value of B — V is dominated by light from R ~ Rg. 
Thus the tight correlation of these variables displayed in the left panel of 
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Figure 4.42 Both the broad-band colors and the central velocity 
dispersions of elliptical galaxies are strongly correlated with line- 
strength indices such as Mg. [From data published in Bender et 
al. (1992)| 


Figure 4.42 does not merely imply that line-strength is tightly correlated 
with broad-band color in a given spectrum. In addition it requires a degree of 
consistency in galactic color and line-strength gradients. We saw independent 
evidence for this consistency in §4.3.2 above. 

Rather surprisingly, it turns out that the central velocity dispersion 
Oo of a galaxy is extremely tightly correlated with the galaxy’s colors and 
line-strength indices — see the right panel of Figure 4.42. The origin of 
this correlation between the speeds with which a stars move and the mean 
spectrum of the light that they emit is not properly understood. 

In §11.2.1 we shall see that in a typical elliptical galaxy, the velocity 
dispersion o varies with radius just as Mg does (Figure 11.6). Davies, Sadler 
& Peletier (1993) find that the relationship o(Mg,) defined by the gradients 
in o and Mgy differs slightly from relationship o9(Mg,) that is defined by 
the right panel of Figure 4.42. 

Let us now turn to correlations that depend on the distances to galaxies. 
Kormendy (1977) demonstrated that a correlation exists between R, and I, 
in the sense that larger galaxies have fainter effective surface brightnesses. 
This surface brightness—effective radius relation is shown in the bottom 
left panel of Figure 4.43, where we plot (Z)., the mean surface brightness 
interior to R, in place of I,, which is more susceptible to measurement error. 
The mean regression is R, « (I)>°83+°-°8 (Djorgovski & Davis 1987). Let 
L,. be the luminosity interior to R,. Then since L, = 7(I)-R2, it follows 
from the correlation of R, and (I), that (I). decreases with L; one finds 
De ~ L~3/2_ More luminous ellipticals have lower surface brightnesses. 
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Figure 4.43 Correlations between four shape-independent parameters of elliptical galax- 
ies. The parameters are the effective radius Re, the mean surface brightness within Re, 
(I)e, the central velocity dispersion gp and Le, the luminosity in Djorgovski’s G band 
interior to Re. The luminosity and the surface brightness are expressed in magnitudes 
and in magnitudes per square arcsecond, respectively. [From data published in Djorgovski 
& Davis (1987)] 


The lower right panel of Figure 4.43 shows the correlation of which this is 
the mean regression. 

More luminous ellipticals have larger central velocity dispersions. The 
upper right panel in Figure 4.43 illustrates this correlation, which is called 
the Faber—Jackson relation after its discoverers (Faber & Jackson 1976). 
Quantitatively one has 

Le ~ 94. (4.38) 


Since 0p is correlated with L, and L, is correlated with R, it follows that og 
must be correlated with R,. The top left panel in Figure 4.43 displays this 
correlation. 

Since go is strongly correlated with line-strengths and colors, the exis- 
tence of a host of additional correlations involving Mg2, B —V etc is implied 
by the top two panels of Figure 4.43. One of the earliest of these to be discov- 
ered was the color-magnitude relation: Faber (1973) showed that more 
luminous elliptical galaxies have stronger absorption lines, and Visvanathan 
& Sandage (1977) showed that more luminous galaxies are redder. 

In all the correlations of Figure 4.43 there is cosmic scatter; the scat- 
ter of the points about the mean relations is larger than can be accounted 
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Box 4.2: Principal Component Analysis 


Given M points x‘*) in a space of d dimensions we wish to choose a 
d-dimensional unit vector n and a constant p such that our points lie as 
nearly as possible on the plane n- x = p. To this end we minimize with 
respect to p and the components of n the quantity 


S=)0[n-x@ — p/P? 


subject to the constraint |n|? — 1 = 0. By the method of Lagrange 
multipliers this problem leads to the equations 


where is the undetermined multiplier. We solve these equations by 
using the last equation to eliminate p from each of the first d equations. 
Then we find that 


O= > (n or <j am . x®))2{°) — rAnj. 


a 


This equation can be rewritten A -n = An, where 
es 1 
Ai; = sa (x! ) a uM we ae (1) 
a B 


Thus the required vector n is an eigenvector of the real-symmetric matrix 
A. One may show (see Problem 4.7) that the eigenvalues X are the 
values taken by S for n equal to the corresponding eigenvector. Hence 
the desired n is the eigenvector with the smallest eigenvalue. 

In order to minimize distortion by measuring errors, the x; should 
have been scaled such that they have numerically equal measuring errors. 


for by measurement errors alone. Interestingly, it turns out that the residu- 
als between the positions of individuals objects and the mean relations are 
correlated. For example, galaxies which lie above the mean Faber—Jackson 
relation in the top right panel tend to lie above the mean surface-brightness— 
magnitude relation in the lower right panel. To understand the significance 
of both the basic correlations and these correlations between residuals, it 
is useful to imagine each galaxy to be represented by a point in a three- 
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dimensional space. The Cartesian coordinates x; of a point in this space 
are the three numbers of the set (log Re, (J), logao), where (J), is in units 
of ytp.* If there were no correlations between the variables, the points of 
individual galaxies would be fairly uniformly distributed within a cuboidal 
region of our three-dimensional space. Correlations between the variables 
will confine the data points to a sub-volume of this cuboid. For example, if 
two of the variables are genuinely independent and the third dependent on 
these two, the points will be confined to a plane, while if there is only one 
independent variable, the points will lie on a line. Our first step in analyzing 
such data is to ask whether the points are nearly confined to a plane; if we 
can find a suitable plane, we can further enquire whether the points lie on a 
line within that plane. In statistics books this type of investigation is called 
principal component analysis. Box 4.2 explains what has to be done. 

For the data in Figure 4.43 one finds that the points are as nearly con- 
fined to a plane as observational errors allow; in the absence of observational 
errors, they might lie precisely on a plane! If the position vector of a galaxy 
in our three-space is g = (log Re, (I)., log ao), where R, is measured in kpc, 
(I). in units of 4g and oo in kms", then the equation of this fundamental 
plane is n- g = 1 where n = (—0.65, 0.22, 0.86). 

Naturally one wants to ‘see’ the fundamental plane. One way to do 
this is to choose an axis, for example the R,-axis, as the horizontal axis, 
and rotate the space about this axis until the plane appears edge-on. In 
this orientation, the plane’s normal, n, lies in the plane of the projection, 
which is spanned by the unit vector, eg, that runs parallel to R,-axis and 
some other, orthogonal, unit vector e. Thus n = aeg + Ge, where a and 3 
are suitable numbers. Comparing this with the value of n given above we 
see that Ge = 0.22e; + 0.86e,. Thus the fundamental plane will be seen 
edge-on if we plot log R, against 0.22((I)./us) + 0.86logoao. Actually, it 
is conventional to plot 0.26((I)./u5) + logao, which is simply 1/0.86 times 
the linear combination we have derived. Figure 4.44 shows this plot. The 


14 L. need not be included in the set since it is related to Re and (I). by Le = (I). R2, 
where Le is measured in W, I, in Wm~?sterad—! and R, in m. 
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data points nearly lie on the line which is the edge-on projection of the 
fundamental plane. The equation of this line 


log R. = 0.36((I)-/up) +1A4log oo (4.39) 


is also an equation for the fundamental plane, as one may verify by multi- 
plying n- g = 1 by 1/0.65. 

The D, — oo correlation Dressler et al. (1987) have defined a readily 
measured photometric parameter that has a tight correlation with oo by 
virtue of the fundamental plane. This parameter, D,,, is the diameter within 
which the mean surface brightness is [,, = 20.75 45. Since D, is defined in 
terms of a fiducial surface brightness, D,/R,. is larger for galaxies of high 
surface brightness than for galaxies of low surface brightness but the same 
luminosity. To quantify this idea, let us suppose that all galaxies have the 


same surface-brightness profile, for example the R'/4 law. Let this profile be 
I(R) =I.f(R/R-). Then from the definition of D, we have 


Qn. fr!” ARRF(R/Re) 
~ 37D? 


= (5), | ae dz xf (x). 


hi 
(4.40) 


From Figure 4.25 we deduce that most of the light interior to J = 20.75 wp 
will come from radii at which f(x) ~ «~* with a ~ 1.2. Using this approxi- 
mation to evaluate the integral, we obtain 


Dn « ReI/* ~ R18, (4.41) 
When equation (4.39) is used to eliminate R, from (4.41), one obtains'® 
Dipicc og tI: (4.42) 


The weak dependence upon J, in this equation is responsible for the tight 
correlation between D,, and oo that was discovered by Dressler et al. (1987). 
If we adopt a distance of 16 Mpc for the Virgo cluster (§7.4.2), their mean 


relation becomes 
D 7 1.33 
ae 2.05(——"___. ) 4,43 

kpc 100kms~! ( 


with a 15% scatter from galaxy to galaxy. 


Dwarf elliptical galaxies In §4.3.1 we saw that while the radial surface- 
brightness profiles of giant ellipticals and compact dwarf ellipticals obey the 


15 Recall that (I)e/up « —2.5 log(Ie). 
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Figure 4.45 Effects of scattering and absorption of light by dust on 
the images of disk galaxies. Light from the top of the bulge reaches 
the observer without obstruction by dust in the disk. Light from 
the lower portion of the bulge is partially absorbed by the disk. 
Some light is forward scattered by the disk into the path to the 
observer. 


R1/4 law fairly closely, the exponential law provides a better fit to the surface- 
brightness profiles of diffuse dwarf ellipticals. There are two other indications 
that diffuse dwarf ellipticals are fundamentally different from giant ellipticals 
and compact dwarf ellipticals: 

1. Diffuse dwarf ellipticals do not lie on the fundamental plane (Kormendy 
1987). 

2. In §11.2.1 we shall see that, although the rotational kinetic energy of 
the most luminous ellipticals is dynamically negligible, lower-luminosity, 
higher surface-brightness ellipticals appear to be largely flattened by 
their rotational kinetic energy. By contrast, diffuse dwarfs of compara- 
ble or lower luminosity have dynamically insignificant rotational kinetic 
energy (Figure 11.7). 


4.4 Photometry of Disk Galaxies 


Although elliptical galaxies are the simplest systems observed, we have seen 
that, on closer inspection, even these display considerable variety and individ- 
uality of form. Now we turn to a discussion of the brightness distributions of 
galaxies which possess a conspicuous disk. The complexity of these systems 
can be very great, for in addition to a disk they often possess an elliptical-like 
bulge and/or a bar, and the disk may display pronounced spiral structure. 
Moreover, they usually contain significant quantities of dust, with the result 
that they are far from transparent in some wavebands. This lack of trans- 
parency severely complicates the task of deducing their three-dimensional 
shapes from images. Therefore it is appropriate to start our study with a 
discussion of the photometric effects of dust. 
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4.4.1 Photometric effects of dust 


The apparent luminosity of a transparent galaxy is independent of the gal- 
axy’s orientation to the line of sight because the stars emit light isotropically. 
But if the galaxy is filled with dust, the galaxy will appear to be less lumi- 
nous when seen at edge-on orientation than when viewed face-on. This is 
because at edge-on orientation starlight typically has to pass a longer column 
of the galaxy’s dusty ISM to reach us, and is liable to be either absorbed or 
scattered out of the line of sight en route to us.16 

Consider how dust affects a galaxy’s measured surface brightness. The 
peak measured surface brightness of a highly flattened but transparent galaxy 
increases as the galaxy is tipped from face-on to edge-on, since at edge-on 
orientation the same luminosity comes from a smaller area of the sky than at 
face-on orientation. By contrast, all the light we receive from a very dusty 
galaxy will have been either emitted or last scattered in a thin layer around 
the galaxy; the thickness of this layer is one optical depth to scattering or 
absorption by dust (see §3.7). Hence a very dusty galaxy will resemble a 
glowing lump of metal in that its surface brightness will be independent of 
orientation, and therefore its apparent luminosity will be proportional to its 
projected area. 

These effects are extremely important because they bias the galaxies 
that are included in catalogues. Consider for example a catalogue that in- 
cludes all galaxies with isophotal major axes larger than some value.!” Ifa 
galaxy is transparent, its isophotal major axis will lengthen as it is tipped 
towards edge-on, and its chances of being included in the catalogue will in- 
crease. By contrast, if the galaxy is dusty, its isophotal major axis and thus 
its chances of being included in the catalogue are essentially independent of 
orientation. It is easy to see that corresponding biases occur when galaxies 
that are brighter than some specified absolute magnitude are included in a 
catalogue. Thus it is impossible to compile a catalogue that is unbiased with 
respect to orientation until one has decided just how dusty galaxies are. 

As we saw in §3.7, blue light is more strongly absorbed and scattered 
than red light, and there is a tendency for dust to scatter light through 
only a small angle. Figure 4.45 shows how these effects lead to color and 
brightness asymmetries between the near and far sides of disk galaxies. These 
asymmetries are important because they enable us to determine which is the 
nearer side of a galaxy, and therefore to conclude from line-of-sight velocity 
measurements in which sense a galaxy is rotating — this knowledge is crucial 
for studies of spiral structure, for example. 

In Figure 4.45, the light received by the observer from the part of the 
bulge on the near side of the galaxy’s major axis is more strongly absorbed 


16 See §3.7 for a discussion of absorption and scattering by dust. 

17 The UGC (Nilson 1973) is an important diameter-limited catalogue. However, its 
diameters are eye-estimates, and differ from photometric diameters in just the way re- 
quired for them to be approximately inclination-independent (Huizinga & van Albada 
1992, Giovanelli et al. 1995). 
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than the light from the far side, because much of the former has to pass right 
through the absorbing dust within the disk. The absorption is so strong in 
type S03 galaxies that it produces a crescent of blackness on their near sides. 
This differential absorption also makes the light from the near side of a 
galaxy appear redder than that from the far side. Scattering by dust within 
the disk has precisely the opposite effect. Light emitted by the upper half 
of the bulge in Figure 4.45 is strongly forward scattered off the dust in the 
disk and thus contributes extra blue light to the galaxy’s image on the near 
side of the major axis, but little ight is scattered toward the observer by the 
far side of the disk. In consequence the near side of the image tends to be 
bluer and brighter than the far side. The relative importance of these two 
competing effects varies with the angle of inclination 7 of the galaxy’s disk 
to the plane of the sky. At small inclinations, absorption dominates over 
forward scattering, and the near side appears dim and red. At intermediate 
inclination (80° 2 7 2 60°), forward scattering can make the near side bluer 
and brighter than the far side. At very large inclinations, light from the near 
side is very heavily absorbed, and forward scattering cannot make up for the 
light lost by absorption (which can be so heavy that it produces dark lanes 
across the image of the galaxy). Extensive discussions of this problem will 
be found in Elvius (1956) and van Houten (1961). 

Unfortunately, the degree to which photometry of disk galaxies is af- 
fected by dust remains controversial — see Giovanelli et al. (1995) or Davies 
& Burstein (1995) for recent discussions. The classical studies of Holmberg 
(1958) and de Vaucouleurs (1959a) argued that disks are fairly transparent 
by showing that the mean surface brightnesses of samples of galaxies that 
had been selected by apparent diameter, is positively correlated with incli- 
nation. Subsequently, a powerful case was developed that disks are by no 
means transparent (Disney, Davies & Phillipps 1989). In §9.3.1 we shall see 
that the Milky Way’s dust reprocesses roughly half of the Milky Way’s lu- 
minosity, which suggests that much of our disk is not transparent. On the 
other hand, the importance of dust for the dependence of photometric pa- 
rameters, such as scale lengths and total luminosities, upon inclination angle 
depends not only on how much dust a disk contains, but also on how it is 
distributed. For example, if the dust forms a smooth, vertically extended 
layer, it will profoundly affect inclination-dependencies, while if it is con- 
fined to small clumps around luminous stars, it will have a negligible effect 
on inclination-dependencies. 

We shall not submerge ourselves and the reader in these complex and 
unresolved questions, but only remark that in view of the possible importance 
of dust, it is desirable to work as much as possible in the near-IR, where 
absorption by dust is considerably less important than in the optical. 


4.4.2 Overall shapes of disk galaxies 


Figure 4.46 shows the distributions of the apparent axial ratios of a sample of 
~ 5000 SO galaxies (left panel) and ~ 13000 spiral galaxies (right panel) that 
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Figure 4.46 The distribution of apparent axial ratios of the APM galaxies. The left-hand 
panel shows the data for 4782 SO galaxies, the right-hand panel for 13482 spiral galaxies 
[After Lambas, Maddox & Loveday (1992) from data kindly supplied by S. Maddox] 


have been automatically measured on survey plates. The two distributions 
are very different: whereas the frequency of SO galaxies rises steadily from 
near zero at g ~ 0.2 to a sharp peak at q = 0.8, the frequency of spirals 
is roughly constant from gq ~ 0.23 to q ~ 0.85. If we hypothesize that 
spiral galaxies are axisymmetric oblate bodies, then from the flatness of 
the distribution in the right-hand panel of Figure 4.46 it follows that the 
distribution of true axial ratios is sharply peaked around some small value 
of the axial ratio q ~ 0.2. In fact, from equation (4.36) one can see that, if 
N() is non-zero only in the interval 0 < @ < 6, where Bo < 1, then f(q) 
is approximately independent of g for g >> {o. This result lends quantitative 
support to the subjective judgment that spiral galaxies are intrinsically quite 
thin. 

By contrast, the long upward sloping portion of the left-hand histogram 
in Figure 4.46 implies that SO galaxies are widely distributed in true axial 
ratio, from q ~ 0.25 to q ~ 0.85. This finding presumably reflects the fact 
SO galaxies tend to have prominent bulges, and these are sometimes nearly 
spherical. Clearly, the apparent axial ratio of a galaxy that has a prominent, 
nearly spherical bulge will be close to unity no matter what the galaxy’s 
inclination happens to be. 

In common with the distribution of apparent ellipticities of elliptical 
galaxies that is shown in Figure 4.33, the distributions for both SOs and spi- 
rals shown in Figure 4.46 dips as it approaches axial ratio g = 1 (€ = 0). 
Lambas, Maddox & Loveday (1992) show that this dip makes it impossible 
to interpret this distribution as the distribution of projected ellipticities of 
any group of axisymmetric oblate bodies. Evidently these objects are not ax- 
isymmetric, a conclusion reached earlier by Binney & de Vaucouleurs (1981) 
from a study of the distribution of apparent axial ratios of disk galaxies that 
appear in the Second Reference Catalogue. This sample is much smaller 
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than that which underlies Figure 4.46, but has the advantage that it may 
be subdivided according to Hubble type. In addition to finding that many 
disks are slightly elliptical they concluded that galaxies of type Sdm and later 
are not particularly flat: their true axial ratios are approximately uniformly 
distributed in the interval 0.2 — 0.8 

It is not clear whether the ellipticity of a typical disk reflects merely spi- 
ral structure or ellipticity of the dark halo that may contribute significantly 
to the potential in which the disk rotates, especially at large radii (Binney 
1978). In an attempt to resolve this question, Rix & Zaritsky (1995) analyzed 
K-band photometry of 18 near face-on galaxies. By observing in the near-IR, 
they hoped to maximize the contribution to the measured luminosity from 
old stars relative to that of young stars. Galaxies were selected for study 
on the basis of the widths of their 21-cm lines: this width is proportional to 
vesint, where v, is a characteristic intrinsic circular speed (§8.2.4), so if u¢ 
is known, 7 can be determined. Rix & Zaritsky estimated v, from the abso- 
lute magnitudes of their galaxies and equation (7.26). They found that their 
galaxies were by no means axisymmetric. About one third were significantly 
lop-sided and about half had two-armed spirals with arm—interarm contrasts 
of order unity. They estimated the ellipticity «gs of the underlying potentials 
and concluded that this peaks near e¢g = 0.045 and has a characteristic width 
~ 0.025. Thus few potentials are genuinely axisymmetric. 

Even when a galaxy is not face-on, it is in principle possible to determine 
es from an analysis of the shape and velocity field of any gas ring that can be 
studied in sufficient detail (§8.2.4). From an analysis of kinematic data for 
NGC 3198 (Figure 8.35), Schoenmakers, Franx & de Zeeuw (1997) tentatively 
concluded that es = 0.019 + 0.003. 


4.4.3 Bulge—disk decomposition 


The literature contains detailed surface photometry of several hundred nearby 
disk galaxies. Simien & de Vaucouleurs (1985) summarize many results ob- 
tained photographically, while Kent (1984) and de Jong & van der Kruit 
(1994) present a considerable body of CCD data. Other useful references are 
Walterbos & Kennicutt (1987) and Wevers, van der Kruit & Allen (1986). 

Figure 4.47 shows surface brightness plotted against radius for three spi- 
rals — these profiles represent average brightnesses around ellipses of similar 
shape to the galaxies’ outer isophotes. In this plot of logJ versus R the 
outermost points for two of the galaxies tend to lie on straight lines; that is, 
at large radii the surface brightness obeys the exponential law (4.20). Since 
this brightness distribution fits the outer brightness profiles of the majority 
of disk galaxies, it has, in fact, come to define the normal disk component of 
all flat galaxies. Deviations from this profile, such as one sees in the profile 
of NGC 5194 shown in Figure 4.47, are generally ascribed to contamination 
of the profile by other components. 
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Figure 4.47 Blue surface brightness versus elliptical radius for 
three spiral galaxies: NGC 2841 (open squares); NGC 3898 (filled 
triangles); NGC 5194 (open triangles). The solar symbol in- 
dicates the estimated surface brightness of the Milky Way near 
the Sun. The outer parts of two of the profiles are fairly straight 
in accord with equation (4.20). [From data published in Boroson 
(1981)] 


Figure 4.47 clearly shows that both NGC 2841 and NGC 3898 are signif- 
icantly brighter near their centers than the exponential law predicts. Images 
of the galaxies indicate that in most cases the additional brightness is con- 
tributed by an elliptical-like bulge, which might well obey the R1/* law. 
Hence we next attempt to fit the observed profile of these galaxies with the 
sum of an R‘/4 law and an exponential law. Figure 4.48 shows that good 
fits can be obtained in this way. 

The right-hand panel of Figure 4.48 shows that the slope of the derived 
disk component can be significantly smaller than the slope of the straight line 
that is tangent to the measured profile at large radii. This difference in slope 
arises because the contribution of the bulge component to the brightness of 
the galaxy is non-negligible even at large R. In fact, if the bulge were to obey 
the R1/4 law at very large radii, there would be a radius beyond which the 
bulge’s light once again dominated that of the disk; clearly, the exponential 
decrease of disk brightness predicted by equation (4.20) will ultimately lead 
to a lower brightness at large R than the less steeply falling R'/4 law. The 
bulge contribution drops below that of the disk at intermediate radii only 
because the scale length of the bulge is smaller than that of the disk. It is 
therefore dangerous to ignore the brightness of the bulge and fit a straight line 
to the large-R behavior of the composite profile. Such a fit to the composite 
profile will inevitably be steeper than the line of the true disk contribution 
and thus yield a smaller scale length Ry (from the steeper slope) and a higher 
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Figure 4.48 Fits to the surface-brightness profiles of NGC 2841 
(left) and NGC 3898 (right). The dotted curves show the expo- 
nential fits to the disks and the dashed curves show the R/* fits 
to the bulges; the full curves show the sums of these components. 
{From data published in Boroson (1981)} 


intercept Jy than the true values for an exponential disk [equation (4.20)]. 

The largest residuals between disk—bulge decompositions and the data 
occur in the transition region between disk and bulge. Freeman (1970) de- 
fined type I systems to be those in which the data bob above the model in 
the transition region, and type II systems to be those, like NGC 3898 in 
Figure 4.48, in which the data dip below the model. It is not clear how sig- 
nificant this division is physically. Many type II galaxies may have holes in 
their disks. Moreover, the bumps in the profiles of some type I galaxies are 
caused by bars — see below. On the other hand, Prieto et al. (1992) caution 
that type II profiles can be caused by dust lanes, while type I profiles may be 
due to a narrow region of enhanced star formation just outside the bulge. In 
principle these various possibilities can be distinguished if accurate colors are 
available, but this task is complex because there is inevitably a steep color 
gradient in the transition region, as the red bulge gives way to the relatively 
blue disk. 

It turns out that the parameters of the disk and bulge that one obtains 
from fits like that shown in Figure 4.48 depend to some extent on how the 
fitting is done. The only reliable way to choose between possible bulge-disk 
decompositions is to fit the models directly to the two-dimensional surface 
photometry rather than to surface brightness profiles such as those shown in 
Figure 4.47. Figure 4.49 shows an example of this process. The right-hand 
panel in the figure shows the difference between projection onto the sky of 
the best-fitting model (middle panel) and the data (left-hand panel). The 
quality of the fit is such that only the galaxy’s spiral arms are visible in this 
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Figure 4.49 Modeling photometry of a galaxy in two dimensions. The left panel shows a 
B-band CCD image of NGC 214; the middle panel shows the projection onto the sky of the 
best-fitting combination of an exponential disk and an R1/4-law bulge; and the right panel 
shows the residuals after this model has been subtracted. The tick marks are at 10 arcsec 
intervals. The ellipse in the right panel is the region over which the fit was performed, 
and the empty circles are regions that were excluded because they are contaminated by 
light from other objects. [Data kindly provided by R. de Jong] 


residual plot. 

The superiority of the two-dimensional data is connected with the radial 
gradient in ellipticity that marks the transition from the bulge-dominated re- 
gion to the disk-dominated region: since the bulge is a truly three-dimensional 
object, its ellipticity will be less than that of the thin disk.'® When the sum 
of a bulge and a disk is fitted directly to the two-dimensional data, the resid- 
uals will be smallest when the ellipticity gradient in the model fits that in 
the data. 

Studies of nearly edge-on galaxies are important for two reasons. First 
they enable us to test the conjecture that the surface-brightness profiles of 
bulges may be fitted with the R!/* law. Second, they enable us to study 
the vertical structures of disks. van der Kruit & Searle (1981b) analyzed 
photographic photometry of the edge-on galaxy NGC 891 and Wainscoat, 
Freeman & Hyland (1989) have analyzed optical and near-IR photometry of 
the edge-on galaxy IC 2531. The images of both galaxies are cut through the 
middle by a dust lane, so that obscuration by dust is a major problem for 
both studies. van der Kruit & Searle (1981b) tried to evade the obscuration 
by fitting model profiles to their data away from the dust lane. These profiles 
were line-of-sight projections of the model luminosity density!® 


ipe ®/ Ra sech?(z/2z9) for R< R. 
R = Joe sec. z 0 Or max 4.44 
i(R, 2) 12 for R > Rinax: oe) 


Here (R, z) are cylindrical polar coordinates that define the symmetry axis of 
the model, and jo, Ra, 2 and Ryax are constants to be fitted to the data. At 


18 Tf the bulge is triaxial, its ellipticity will be greater than that of the disk for sufficiently 
face-on orientations. 

19 Our definition of zo differs from that of van der Kruit & Searle (1981b) by a factor 
2 in order that j « e~?/20 at z >> z. 
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face-on inclination this model obeys the exponential surface-brightness law 
at R < Rmax. The sech” functional form in (4.44) was motivated by theory 
— see Problem 4-25 of BT. Remarkably, van der Kruit & Searle (1981b) were 
able to fit their data for NGC 891 without allowing any dependence of the 
parameter z upon R — the characteristic thickness of the disk appears to 
be independent of radius. They found that their data could be satisfactorily 
fitted by the model with Rg = 4.9kpc, z = 0.5kpc, Rmax = 21 kpc. They 
concluded that the observed profile drops extremely steeply at Ryax, where 
the surface brightness of the galaxy is about 4 mag below the estimated sky 
level. Barteldrees & Dettmar (1994) reach a similar conclusion on the basis 
of CCD photometry of a sample of 27 more distant edge-on galaxies. This 
cutoff is harder to see in face-on galaxies, but by going to surface-brightnesses 
as low as 26 ur, Morrison, Boroson & Harding (1994) detected cutoffs in two 
galaxies. By contrast, de Jong & van der Kruit (1994) found no evidence of 
cutoffs in a larger sample of galaxies. 

Because they possessed near-IR photometry, Wainscoat, Freeman & Hy- 
land (1989) were able to model their data for IC 2531 right down to the 
middle of the dust lane. They concluded that a successful model has to 
contain three components: an old disk of B — V color 0.78, a young disk of 
B-V color —0.04, and a dust layer. The vertical density profiles of these 
components need to be more sharply peaked near z = 0 than the model of 
(4.43) allows. The data can be successfully modeled if the density profiles of 
all three components are assumed to be of the form 


+e R/Ra—|2l/= 
j(R, z) = a Sees ZON HS Waste (4.45) 


0 for R > Rmax; 


with Rg = 6.4kpc and Rmax = 4Rqa. The old disk had z = Rqy/12, the 
young disk had z9 = Rg/96, the dust layer had zp = Ry/48. The luminosity 
density jo was the same for both the old and the young stellar disks. For the 
dust layer jo = 1.6magkpc?. 

Burstein (1979) examined the distribution of light along lines that run 
perpendicular to the major axes of five highly inclined SO galaxies. The 
crosses in Figure 4.50 show three representative profiles for one of these 
galaxies, NGC 4570: those along lines that cut the major axis at R = 6”, 42” 
and 60”. The diamonds show the corresponding surface brightness profiles 
that are predicted by modeling the major-axis brightness profile with a sum of 
an infinitesimally thick exponential disk and a bulge that obeys the R!/4 law. 
In each case the diamonds fit the crosses down to I ~ 23.543, confirming 
that the inclination of the disk and the flattening of the bulge have been 
correctly determined. Fainter than I ~ 23.5 wg the crosses lie systematically 
above the diamonds. From this observation Burstein concluded NGC 4570 
has a thick disk in addition to a thin disk and an R!/* bulge.2° The 


20 In her 1977 University of Texas thesis, Tsikoudi suggested that some SO galaxies had 
thick disks. 
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Figure 4.50 Perpendicular profiles of NGC 4350 at major-axis 
intercepts R = 6”, R = 42”, R = 60” [After Burstein (1979) 
courtesy of D. Burstein] 


luminosity density of a thick disk is often modeled by equation (4.45) with 
Zo & 1kpc. From the observation that in Figure 4.50 the thick disk starts to 
dominate the thin disk at J ~ 23.5 wp, it follows that the radial scale-length 
Ra must significantly exceed the radial scale length of the thin disk. 

The phenomenon detected by Burstein (1979) in five SO galaxies, has 
since been observed in many but not all disk galaxies. It is now thought that 
most SO galaxies and many spirals have perpendicular profiles of Burstein’s 
type. However, the correct physical interpretation of these profiles is unclear. 
One possibility is that the light Burstein attributes to the thick disk should 
be assigned to the bulge. Indeed, even if bulges were physically identical to 
low-luminosity elliptical galaxies, their luminosity profiles would have been 
distorted from those characteristic of elliptical galaxies by the strong gravi- 
tational fields of the embedded disks (Freeman 1978, Rowley 1988). Another 
possibility is that not all disks have exponential vertical profiles; there is no 
theoretical reason to expect an exponential dependence on z — see Prob- 
lem 4.10. In §10.4.3 we shall see that the Milky Way displays a similar 
phenomenon to that studied by Burstein, and important additional clues as 
to its physical origin are provided by studies the chemical composition of the 
stars as a function of their distance above the Galactic plane. 

The minor-axis luminosity profiles of nearly edge-on galaxies enable us 
to test the conjecture that bulges can be adequately fitted with the R'/4 
law. Kent (1985) shows the minor-axis profiles of 105 disk galaxies of all 
morphological types, together with fits to the data by the sum of exponential 
and R1/4 laws. The great majority, of these fits are of high quality, although 
there are significant deviations between model and observations in a few 
cases. Similarly, Burstein (1979) was able to obtain good fits to his minor- 
axis data with the sum of exponential and R!/4 laws. However, the minor- 
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axis profiles of some galaxies are definitely not adequately fitted by this 
model and are better fitted by an exponential profile. The classic example is 
the near edge-on Sb galaxy NGC 4565 (Kormendy & Bruzual 1978, van der 
Kruit & Searle 1981a). Bahcall & Kylafis (1985) have shown that existing 
data are usually unable to distinguish between bulges that follow the R‘/4 
law and ones that take the form of thick disks. 

Once surface photometry of large numbers of disk galaxies has been 
decomposed into disk and bulge components, it is interesting to look for 
correlations between the values of the parameters that are determined by the 
fits, and also seek correlations with Hubble type (Simien & de Vaucouleurs 
1985, Kent 1985). The fits involve four parameters: I,, R. for the bulges 
and Ig and Rg for the disk — see equations (4.18) and (4.20). From these it 
is interesting to calculate a fifth parameter, the bulge fraction, 


R21, 


Bia es 
/ R21, + 0.28R2 Iq’ 


(4.46) 


which is the fraction of the total luminosity that is contributed by the bulge. 
B/T is related to the disk-to-bulge ratio, which is frequently cited in the 
literature, by D/B = (B/T)~!—1. The advantage of B/T is that it is 
confined to the interval (0, 1). 

Figure 4.51 shows that the bulge fraction B/T is quite tightly correlated 
with Hubble type, falling from a mean value of 0.65 for lenticular galaxies, to 
a mean value of 0.15 for galaxies of type Sc and later. This correlation quan- 
tifies the basic idea of the Hubble’s classification according to the prominence 
of the bulge. From the point of view of cosmology it is important because 
it argues against the possibility that all SO galaxies are simply typical spiral 
galaxies that have somehow lost their gas and therefore ceased to display 
spiral structure or form new stars. Of course, a disk that ceases to form new 
stars will gradually fade relative to its associated bulge, but estimates of this 
effect (see §5.4) indicate that it is insufficiently powerful to account for the 
steepness of the trend displayed in Figure 4.51. 

The left-hand panel in Figure 4.52 shows that the bulges of galaxies of 
Hubble type Sb and earlier have a rather similar distribution in the (Re, I.) 
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Figure 4.52 The distribution of 66 disk galaxies in analogs of the 
lower left panel of Figure 4.43. The left panel shows the surface- 
brightnesses and effective radii of the bulges, while the right panel 
shows the corresponding disk parameters. Different symbols cor- 
respond to galaxies of different Hubble types: SO (crosses); Sa~- 
Sab (open triangles); Sb (filled triangles); Sbc (open squares); 
Sc-Scd (filled squares). If the ellipticals of Figure 4.43 were to 
be plotted in the left-hand panel, they would cluster around the 
dashed line. {From data published by Kent (1985), who adopts 
Hop = 100kms~! Mpc™? |] 


plane to that of elliptical galaxies - compare the bottom left-hand panel of 
Figure 4.43. In fact, if the elliptical galaxies of Figure 4.43 were to be plotted 
in the left-hand panel of Figure 4.52, they would cluster around the dashed 
line shown.?! The bulges of Sc galaxies lie systematically lower in the (Re, Ie) 
plane. 

The right-hand panel of Figure 4.52 shows that the corresponding pa- 
rameters for the disk, Rag and pg, are tightly correlated in the same sense: 
physically larger systems have lower central surface brightnesses. Freeman 
(1970) noted that the measured central surface brightnesses of disk galax- 
ies clustered around J = 21.7 ug — this result is often called Freeman’s 
law. Subsequently, it turned out that this clustering was a selection effect: 
the apparently largest and brightest galaxies were the first to be measured 
carefully. In reality there exist significant numbers of intrinsically fainter 
galaxies, but one has to look carefully for such objects. Schombert et al. 
(1992) and Impey ez al. (1996) have systematically searched for low sur- 
face brightness (LSB) galaxies, which may usefully be defined to be objects 


21 The dashed line in Figure 4.52 has been calculated by displacing the ridge-line of 
the distribution of Figure 4:43 to allow for the different distance scales and measures of I 
employed. 
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with I(0) fainter than 23 zg. Since for a bulge 23 ug ~ 21.7 p,, only a few 
LSB galaxies contribute to the right-hand panel in Figure 4.52. 

It is worth noting that Peletier et al. (1994) have shown that near- 
infrared photometry generally yields smaller values of Ra than B-band pho- 
tometry. This phenomenon reflects the fact that disks tend to become bluer 
at larger radii — see §4.4.5. 


4.4.4 Shapes of bulges 


Ideally, we would like to investigate the three-dimensional shapes of bulges 
using the same sort of analysis that we applied to the apparent ellipticities 
of elliptical galaxies in §4.3.3. Unfortunately, the contribution of light from 
the other components in disk galaxies complicates matters. For example, if 
a disk galaxy contains a bulge that is highly flattened towards its galactic 
plane, it will be very difficult to disentangle the bulge light from that of 
the disk. The distinction between bulges and galactic bars is even harder to 
draw: a prolate spheroidal structure whose major axis lies in the plane of a 
disk galaxy could equally well be identified as a prolate bulge or a bar. It is 
therefore perhaps unsurprising that — as we shall see in §4.4.7— the properties 
of bulges and bars seem to be very closely related. 

Despite these complexities, several studies have estimated the shapes 
of galactic bulges. In his photometric decomposition of a sample of 66 disk 
galaxies, Kent (1985) estimated the shapes of the bulges as well as their 
radial profiles, and found ellipticities spanning the range 0 < eg < 0.7. As 
Figure 4.53 illustrates, this sample shows no correlation between eg and 
the characteristic surface brightness, I., of the R'/* law that fits the bulge 
[equation (4.18)]. If the bulges were all identical oblate spheroids viewed 
from different directions, the more edge-on systems would have larger values 
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of both I, and eg. Even if the bulges were oblate spheroids with a range 
of flattenings, one would expect I, to be correlated with ¢g in a sample of 
galaxies with a range of inclinations. The absence of such a correlation might 
therefore be taken as evidence that bulges have more complex shapes than 
simple oblate spheroids. Alternatively, it might simply reflect the small size 
of the sample, systematic difficulties in disentangling the various components 
in disk galaxies, or the pervasive influence of obscuration by dust on measured 
surface brightnesses (see §4.4.1). 

One piece of evidence which suggests that bulges can be represented by 
simple oblate spheroids, has come from studies of their kinematics. As we 
shall see in §11.3.1, analyses of the kinematics of bulges in edge-on disk galax- 
ies show that flatter bulges rotate more rapidly. In fact, their ellipticities turn 
out to be very close to what one would expect for simple axisymmetric oblate 
spheroids that have been flattened by rotational motion around axes that are 
parallel to those of the surrounding disks. 

As for elliptical galaxies, further information on the properties of bulges 
can be gleaned from any departures in their isophotes from perfect ellipses. 
Once again, the presence of the other components in disk galaxies complicates 
the analysis: if part of the disk’s light is erroneously attributed to the bulge, 
then, not surprisingly, the bulge will appear to have disky isophotes with 
a4 > O [see equation (4.10)]. Similarly, the presence of non-axisymmetric 
structure such as spiral arms and patchy obscuration by dust can distort 
the shapes of fitted isophotes. Studies of the shapes of bulge isophotes have 
therefore generally been restricted to disk galaxies that are close to edge-on: 
away from the plane of such a galaxy bulge light is relatively uncontaminated 
by other components. Since bulges tend to be flattened towards the plane of 
their surrounding disks, the exclusion of emission close to the plane makes it 
significantly harder to detect deviations of isophotes from ellipses — see Fig- 
ure 4.20. Nevertheless, such analyses have revealed departures from elliptical 
isophotes in a significant fraction of bulges. de Souza & dos Anjos (1987) 
found that ~ 25% of a sample of edge-on galaxies contained bulges with dra- 
matically boxy isophotes. In some cases, the isophotes have been squashed to 
the degree that they have a double-lobed structure, rather like an unshelled 
peanut. The image of NGC 5746 in Figure 4.60 provides an example of such 
a peanut-shaped bulge. As we shall see in §4.4.7, this phenomenon seems to 
be related to the presence of a bar in the galaxy. 


4.4.5 Color and metallicity gradients in disk galaxies 


Figure 4.54 shows three color indices as a function of radius in the Andromeda 
nebula, M31. Interior to ~ 6kpc the light is dominated by the bulge and 
the colors are those of an elliptical galaxy. Slightly further out there is a 
steep drop in U — B and a smaller drop in B — V as young stars begin to 
make a significant contribution to the overall surface brightness. All three 
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color indices show the disk to be getting steadily bluer from ~ 9kpc to at 
least 20 kpc. However, the gradients are shallow and require highly smoothed 
data for their detection. Few galaxies have been as carefully observed and 
reduced as M31. Consequently the evidence for systematic color gradients in 
the disks of other galaxies is weak — see Prieto et al. (1992) for an indication 
of how varied major-axis color profiles typically are. It is reassuring that 
Peletier et al. (1994) have shown that near-infrared photometry generally 
yields smaller values of Rg than B-band photometry. 

There are three potential causes of color gradients in disks: (i) gradients 
in the degree of internal extinction by dust, (ii) gradients in the mean ages of 
stars, and (iii) metallicity gradients — see §8.2.6. In practice all three factors 
are probably in competition in any given galaxy, with the result that many 
major-axis color profiles show no systematic trend. 


4.4.6 Spiral structure in disk galaxies 


Spiral structure can take many forms. Some galaxies display grand-design 
spiral structure; that is, in the B band their disks are dominated by two great 
arms. M51, M81 and M100 are famous examples of grand-design spirals. 
Since these objects are photogenic, they tend to feature in coffee-table books 
and on posters. Grand-design spirals make up arm-class 12 in the Elmegreen 
& Elmegreen (1982) classification. Only ~ 10% of spiral galaxies fall into 
this class (Elmegreen & Elmegreen 1987); most spiral galaxies have more or 
less ratty or flocculent spiral structure. 

A good way to study spiral structure is to analyze the difference between 
an image and the azimuthal average of that image. Figure 4.55 shows such 
difference images for the grand-design spiral M51. These images show two 
general properties of spiral structure: (i) spiral structure is present in both 
blue and red images but it has larger amplitude in blue images; (ii) spiral 
structure is smoother in red than in blue images. They also show a general 
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Figure 4.55 Spiral structure 
in M51. These images are the 
differences between images of 
M51 and the azimuthal av- 
erages of those images. The 
top panel is for the B band 
and the bottom image is for 
the I band. [After Elmegreen, 
Elmegreen & Seiden (1989) by 
courtesy of B. Elmegreen] 


characteristic of grand-design structure: the amplitude of the arms is modu- 
lated in a systematic way as a function of angle around the arm. Specifically, 
if in the lower panel of Figure 4.55 one follows the arm whose inner end is at 
3 o’clock, one sees that is passes through a minimum in amplitude around 5 
o’clock, and then another at 9 o’clock. Each of these minima is mirrored in 
the other spiral arm. 

Figure 4.56 shows U, B and O azimuthal profiles for the giant Sb galaxy 
M81 at r = 475”, together with two of the associated color indices. Two 
points should be noted about this diagram. (i) The two spiral arms which 
feature so clearly in the top panel appear in the U band to be made up of 
a fairly symmetrical broad wave, on top of which is superimposed a pair of 
narrower peaks. In the B and O bands, these narrower peaks are much less 
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Box 4.3: Pitch Angles 


The spiral arms in Figure 4.55 are roughly logarithmic spirals, ¢ « u, 
where ¢ is the usual azimuthal angle and u = In(R). This observation 
suggests that it might be useful to express the brightness J of a disk as 
a Fourier sum of logarithmic spirals: 


i: s 
I(u, 6) = ae x 7 dp Im(p)ePut”?) 


mM=—co% 


Im(p) is the amplitude of the m-armed spiral of pitch angle 7 = 
—arctan(m/p). For a grand design spiral I,,(p) peaks strongly at m = 2 
and some value po of p, and one says that the galaxy has pitch angle 
wo = — arctan(2/p9). Pitch angles have been estimated for many galax- 
ies — see Garcia Gomez & Athanassoula (1993) for details. They increase 
with Hubble stage from ~ 10° at Sa to ~ 18° at Sc. 


conspicuous. (ii) The bottom panel of Figure 4.56 shows that the (B — O) 
color averages ~ 0.8 mag and changes by no more than +0.2 mag across an 
arm. 

In his classic study of six giant spirals Schweizer (1976) found that, in 
the O (orange) passband, the arms typically contribute about 17% of the 
total brightness at 3kpc, rising to 50% of the brightness at 15kpc. The 
arms are generally 20% stronger in the B passband and 50% stronger in 
the U passband than they are in the O band. The underlying disks prove 
to be very uniform in color and rather red. For the disks in his sample, 
(B —V) & 0.75 + 0.05, which is redder than many old galactic clusters, 
although it is a little over 0.1mag bluer than the central parts of giant 
elliptical galaxies. As Figure 4.55 shows, the ridges of arms are bluer than a 
disk, so the growth in strength of the arms relative to the disk with increasing 
radius causes the overall colors of the galaxy to become slightly bluer at large 
radii. 

If we anticipate results concerning the interstellar media of galaxies that 
will be discussed in §8.2.3, the observations described above are fairly easy 
to understand. First, in all cases in which it is possible to determine which is 
the nearer and which the further side of a spiral galaxy (see §4.4.1), velocity 
measurements show that the spiral arms are trailing.?? That is, stars and gas 
enter the arms on their concave inner edges and leave them on their convex 
outer edges. 

Second, surface brightness at the blue end of the spectrum is sensitive 


22 In the majority of cases it is not clear which is the nearer side of a galaxy. Then one 
determines this by assuming that the arms trail. NGC 4622 has both leading and trailing 
arms Buta, Crocker & Byrd (1992). 
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Figure 4.56 Spiral structure in M81. U, B, and O azimuthal 
profiles at r = 475 arcsec. The angle ¢ is measured from the ap- 
parent major axis in the direction of rotation. The profile has been 
continued periodically beyond ¢ = 360°. Notice how the U pro- 
file consists of narrow peaks superposed on a broader wave pattern 
which resembles the B and O profiles. Much of the small fluctua- 
tion in the (B3 ~ O) color can be accounted for by an instrumental 
malfunction. [After Schweizer (1976)] 


to the presence of short-lived stars. These form on the inner edges of spiral 
arms as a response to shock-compression of interstellar gas, blaze briefly 
as very blue objects as they traverse the peak of the arm, and are then 
extinguished. Surface brightness in the red and infrared is determined by a 
wider range of stars, including objects that do not evolve significantly in the 
time it takes a star to drift across a spiral arm. Hence O- and J-band surface 
brightnesses give truer impressions of the enhancement in the mass density 
that underlies spiral structure. This is why in Figure 4.55 arms are fuzzier 
and less pronounced in J than in B. 

The uniformity and numerical magnitude of the color of the broad com- 
ponents in Figure 4.56 is of some importance since it very strongly suggests 
that old disk stars (of relatively late spectral types) contribute about 40% 
of the increased brightness found near the spiral arms.?3 If we accept this 
conclusion, it follows that spiral structure is not an entirely gas-dynamic 


23 In principle the observed uniformity of color could be consistent with all the arms’ 
light coming from young stars if the latter were unusually heavily reddened — by Ay » 
0.5 mag. 
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phenomenon, which leaves the bulk of the mass in the disk quite unmoved. 
Instead it must be associated with some sort of gravity wave that propagates 
through the stellar disk. 

Finally, the minima in the amplitudes of the arms as a function of az- 
imuth have been convincingly interpreted as nodes in the interference pattern 
of two spiral waves, a dominant trailing wave and a subordinate leading wave 
(Toomre 1981). That is, at the minima, a trough of the leading wave crosses 
the peak of the trailing wave, partially canceling it. In this picture the disk 
forms a resonant cavity within which energy is carried in and out by leading 
and trailing spiral waves. 

It is thought that grand-design spiral structure is usually generated by 
either a rotating bar within the galaxy (see below), or by the tidal grav- 
itational field of a companion galaxy. In the case of M51, the perturbing 
companion, NGC 5152 is clearly visible to the North in Figure 4.55. 

The more common but less photogenic ratty spiral structure is probably 
caused by ephemeral gravitational instabilities within the disk (Sellwood & 
Kahn 1991). The dynamical details are complex and not fully understood but 
we know that in disks both stars and gas have small random velocities (see 
88.1.4, §9.2.6 and §11.3.2) with the consequence that any initial irregularities 
are quickly amplified into short spiral arms. These arms convert some of the 
disk’s ordered rotational kinetic energy into random kinetic energy, which 
inhibits the formation of further arms. Hence galaxies can continue to display 
spiral structure only if they have a mechanism for dissipating disordered 
kinetic energy. In gas-rich systems, interstellar gas provides this mechanism. 
In §8.2.5 we shall see that SO galaxies have relatively little interstellar gas, 
and this may explain why they do not display conspicuous spiral structure. 


4.4.7 Barred galaxies 


Of order one half of all disk galaxies are barred. In particular the Milky Way 
is believed to be barred (see §10.1). Therefore the study of barred galaxies 
is an important field. It is, however, a difficult field and the effort that has 
so far been devoted to it is small compared with the effort that has been 
devoted to the study of elliptical galaxies, for example. 

Fortunately, over the last decade advances in the techniques of surface 
photometry have opened up the field and the literature now contains high- 
quality photometry of a fair number of barred systems. Bars occur in the 
bright inner parts of galaxies, so study of them benefits considerably from 
the large dynamic range of CCD detectors relative to photographic plates. 
As we saw in §4.1, dust lanes are often prominent on blue images of barred 
galaxies. While a dust lane can be of considerable interest when modeling 
the flow of interstellar gas, it does make it hard to determine the luminosity 
distribution. Hence, near-IR photometry, in which dust lanes are much less 
prominent, is extremely valuable and is now becoming accessible. 
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Figure 4.57 B-band isophotes of two typical SBO galaxies, NGC 4371 (left) and NGC 
4477 (right). The fields are 5.3arcmin on a side and the isophotes are 0.5 mag apart. 
[After Wozniak & Pierce (1991) courtesy of H. Wozniak] 


Elmegreen & Elmegreen (1985) and Baumgart & Peterson (1986) be- 
tween them published photographic photometry of 24 barred spiral galaxies 
of all types. Jarvis et al. (1988), Forbes & DuPoy (1992) and Wozniak & 
Pierce (1991) have published CCD photometry for over a dozen early-type 
barred galaxies. Figure 4.57 shows the B-band isophotes of two typical SBO 
galaxies. At small radii nearly elliptical isophotes crowd closely together. 
Then the isophotes become suddenly more elongated and less densely spaced. 
This is the region of the bar. Slightly further out still there is an abrupt 
change in the direction of the isophotes’ major axis as the bar gives way to 
an inclined disk. Figure 4.58 shows the results of fitting ellipses to the iso- 
photes of these two galaxies by plotting the surface-brightness and ellipticity 
of each ellipse versus the length of its semimajor axis. In these plots each 
bar is associated with characteristic features: a slight bump in the luminosity 
profile and a sharp peak in the ellipticity profile. 

Notice in Figure 4.57 that the isophote of NGC 4371 that most nearly 
delineates the bar is not at all elliptical; rather it resembles an oval tray 
with two handles sticking out of it. This morphology is characteristic of 
bars, although it is not always observed as the example of NGC 4477 shows. 
Athanassoula et al. (1990) show that good fits to such isophotes can be 
obtained with generalized ellipses, which are the curves defined by 


({x|/a)° + (lyl/)° = 1, (4.47) 


where a, b and c are constants characterizing the curve. The larger c is, the 
squarer is the generalized ellipse. 

It is obviously important to determine the true shapes of bars. There are 
two difficulties. First one must determine the inclination of the system. This 
is usually done by assuming that the outermost isophotes, which should be 
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Figure 4.58 The results of fitting ellipses to the isophotes of NGC 
4371 and NGC 4477 shown in Figure 4.57. Here a and € are, respec- 
tively, the length of an ellipse’s semimajor axis and its ellipticity 
1—b/a. The bar in NGC 4371 produces a bump in the luminos- 
ity profile near a = 0.5arcmin while that in NGC 4477 produces 
a bump at @ = 0.4arcmin. There are corresponding peaks in the 
ellipticity profiles. The bulge of NGC 4371 is triaxial and gives rise 
to a peak in the ellipticity profile at a ~ 0.2 arcmin. [After Wozniak 
& Pierce (1991) from data kindly supplied by H. Wozniak] 


generated by the outer disk, beyond the influence of the bar, are intrinsically 
round. The second and more difficult problem is making allowance for the 
presence of the bulge, which is obviously prominent in early-type galaxies 
such as those shown in Figure 4.57. Athanassoula et al. (1990) simply mask 
out the center of their galaxies and fit generalized ellipses to what remains of 
the isophotes. This procedure indicates that the ellipticity of the bar declines 
roughly linearly with radius, from values of order 0.7 at the smallest radius 
for which an estimate can be obtained. Thus they conclude that bars have 
axis ratios in the plane of 3:1 or greater. 

The presence of a prominent bulge makes it hard to determine how 
surface brightness varies as a function of distance down the major axis of 
a bar. Nonetheless, Elmegreen & Elmegreen (1985) and Elmegreen et al. 
(1996) have estimated this quantity for about two dozen galaxies. Figure 4.59 
shows results for two characteristically different galaxies. In each panel the 
full curve shows the surface-brightness profile of the galaxy along the bar’s 
apparent major axis, while the dashed curve shows surface-brightness along 
the bar’s minor axis. Double-headed arrows show the extent of each bar along 
the major axis. The left-hand panel is for the SBbc galaxy NGC 1300. In 
this galaxy the surface brightness is essentially independent of radius outside 
of a nuclear cusp. The right-hand panel in Figure 4.59 is for the SBcd galaxy 
NGC 7741, in which the surface brightness falls at least as steeply within 
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Figure 4.59 J-band surface brightness profiles of two barred galax- 
ies; full curves are for the major axes, dashed curves are for the mi- 
nor axes. The arrows indicate the lengths of the bars. The major- 
axis profile of the SBbc galaxy NGC 1300 at left is characteristic of 
early-type galaxies. In it the surface brightness drops less steeply in 
the bar than outside it. The major-axis profile of the SBcd galaxy 
NGC 7741 at right is characteristic of late-type systems. [Adapted 
from Elmegreen et al. (1996) courtesy of B. Elmegreen.] 


the bar as it does outside it. Elmegreen & Elmegreen (1985) suggested 
that in early-type systems the surface brightness declines more slowly with 
increasing radius within the bar than in the surrounding disk, as in the left- 
hand panel of Figure 4.59, while in late-type galaxies the surface brightness 
declines as steeply within the bar as outside it, as in the right-hand panel of 
Figure 4.59. 

A key dynamical parameter of a stellar bar is its pattern speed, that 
is, the angular velocity with which its figure rotates. It is also useful to define 
the corotation radius, Ror, which is the radius at which a star can move 
on a circular orbit at the same angular frequency as the bar’s pattern — see 
§3.3.2 of BT. Elmegreen et al. (1996) summarize the available estimates of 
the ratio of Ror to the length of the bar’s semi-major axis, and conclude 
that, with one exception, this ratio is 1.2+0.2. Thus, bars extend to about 
80% of the corotation radius. 


Vertical structure of bars In order to study the structure of galactic 
bars in the third dimension, perpendicular to the planes of their host galaxies, 
we must obtain photometry of edge-on barred galaxies. Unfortunately, the 
structure that we identify as a bar is only discernible in a reasonably face-on 
galaxy, and so it is almost impossible to tell whether an edge-on galaxy con- 
tains a bar. The development of our understanding of the three-dimensional 
structure of bars is therefore largely dictated by the results of numerical sim- 
ulations. The most important discovery from these calculations is that thin 
bars seem to be generically unstable structures which buckle about their 
midpoints out of the plane of the bar (Combes & Sanders 1981, Raha et 
al. 1991). The simulations show that the bar initially bends into an asym- 
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metric banana-shape, but that it ultimately regains its symmetry about the 
midplane of the galaxy, forming a thickened double-lobed structure rather 
like an unshelled peanut. Viewed end-on such fattened bars appear round, 
but when viewed from the side they reveal their distinctive double-lobed ap- 
pearance. These numerical results are intriguing, since, as we saw in §4.4.4, 
~ 25% of edge-on bulges seem to have boxy or peanut-shaped isophotes. It 
is therefore tempting to suggest that these boxy bulges contain buckled bars 
that are viewed more-or-less edge-on, while the remaining ~ 75% of galaxies 
with elliptical bulges are made up from unbarred galaxies and barred galaxies 
where the bar happens to be oriented approximately end-on. 

Circumstantial support for this hypothesis has come from a study of 
the boxy-bulged SO galaxy NGC 1381. de Carvalho & da Costa (1987) 
have applied photometric decomposition techniques (§4.4.3) to images of the 
galaxy, and they find that this system appears to contain a third component 
in addition to the usual disk and bulge. This component, intermediate in 
size between the bulge and the disk, has an approximately uniform surface 
brightness that cuts off sharply at the same radius on either side of the 
galaxy. As we have seen above, bars in early-type disk galaxies have exactly 
this structure, and so these observations have been interpreted as implying 
that this boxy-bulged galaxy also contains a bar. A more extensive study 
of the photometry of edge-on galaxies by Dettmar & Barteldrees (1990) has 
confirmed that extra bar-like components in the light distribution are closely 
associated with boxy and peanut-shaped bulges. However, this evidence 
is not conclusive: the extra component could be an entirely axisymmetric 
distribution of light rather than a bar. Similarly, it is possible to construct 
axisymmetric bulge models with a toroidal structure which appear peanut- 
shaped in projection (Rowley 1988). Thus, the photometric observations do 
not provide direct evidence for non-axisymmetric structures in these edge-on 
galaxies. 

A further probe of non-axisymmetric structure in edge-on galaxies has 
been proposed by Kuijken & Merrifield (1995). They point out that the orbits 
followed by material in a barred potential are more complicated than those 
in an axisymmetric system. They have therefore obtained optical spectra of 
edge-on galaxies in order to study the motions of the gas within these systems 
via their Doppler shifts. As Figure 4.60 illustrates, galaxies with elliptical 
bulges tend to have simple kinematic structures, with all the material on one 
side of the galaxy redshifted due to its motion away from us, whilst all the 
material on the other side is blueshifted due to its motion towards us. These 
simple kinematics imply that the material is following the circular orbits that 
we would expect in an axisymmetric galaxy. Galaxies with peanut-shaped 
bulges, on the other hand, reveal much more complex structure in the line- 
of-sight motions of their constituent gas, implying that this material follows 
the more complicated orbits associated with a barred potential. As we shall 
see in §9.4, similar reasoning has been applied to spectral observations of 
the Milky Way to infer that our own galaxy is also barred; it is therefore 
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NGC 1055 NGC 5746 


Position along major axis 


Figure 4.60 Hlustration of the complex gas kinematics associated with peanut-shaped 
bulges. The top panels show images of a round-bulged galaxy (NGC 1055) and a galaxy 
with a peanut-shaped bulge (NGC 5746). The dust lanes which cut across the bulges of 
these galaxies indicate that they are close to edge-on. The lower panels show the 658.3 nm 
{NIJ} emission line as observed along the major axes of these systems (dashed lines in top 
panels). The plots show the observed wavelength X of the emission line (either redshifted 
or blueshifted due to the line-of-sight motion of the gas) as a function of position along 
the major axis. 


interesting to note that the bulge of the Milky Way has a distinctly boxy 
appearance (see Figure 1.8). Thus, there is a clear kinematic link between 
peanut-shaped bulges and barred galaxies, which strongly suggests that the 
vertical structure of bars produces this double-lobed structure. 


Rings in SB galaxies In §4.1 we saw that barred galaxies often contain 
rings and/or lenses. Buta (1986) measured the diameters and orientations of 
these features in a sample of over 1200 objects. He found that ~ 75% of his 
galaxies have an inner ring. From the distribution of apparent axial ratios of 
these rings he concluded that the rings have intrinsic axial ratio g 2 0.9 in 
SA galaxies and q ~ 0.8 in SB galaxies. The measured distribution of angles 
between the apparent major axes of the bar and the ring can be reproduced 
by a model in which the true major axis of the ring coincides with the true 
major axis of the bar. 

Buta found outer rings in ~ 50% of his galaxies, almost all in clearly 
barred systems. These rings are on average slightly less elliptical than inner 
rings. He was unable to reproduce his distribution of angles between the 
apparent major axes of the bar and outer ring by assuming that the corre- 
sponding true axes are aligned; he required that some be mutually parallel 
and some mutually perpendicular, with the majority being perpendicular. 

Some galaxies have more than one ring and it is interesting to inspect the 
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Figure 4.61 The Small Magellanic Cloud. Notice the egg-shaped 
bar which is clearly displaced from the center of the surround- 
ing luminosity. [Photograph from the Carnegie Atlas courtesy of 
A. Sandage] 


distribution of the ratio of the diameters of these rings. This distribution is 
strongly peaked around 2, independent of Hubble type, but the width of the 
peak in the distribution does seem to exceed that expected from measuring 
errors alone. 

Since rings are visible in near-IR frames as well as in B-band frames, 
they cannot be exclusively hydrodynamical phenomena that influence only 
the distribution of young stars. The mechanism responsible for rings must 
alter the distribution of older, redder stars. 


Dust lanes in SB galaxies Many barred galaxies display characteristic 
dust lanes. These often run almost but not quite radially from the ends of 
the bar; they keep toward the side of the bar which leads in the rotation of 
the whole galaxy. An excellent example of this phenomenon can be seen in 
the photograph of NGC 1300 in Figure 4.7. In galaxies later than the SBb 
galaxy NGC 1300, the dust lanes run along lines further removed from the 
principal axis of the bar, and through the less-well-illuminated disk. In very 
late-type specimens, the absorption lanes may be seen to curve in toward 
the nucleus as they reach the bar’s minor axis, thus partially obscuring the 
center of the galaxy. In this event, a contour map of the galaxy’s brightness 
will appear double peaked. These dust lanes are produced by the streaming 
of gas along the length of the bar. 


Lop-sidedness in SB galaxies Barred galaxies are commonly lopsided. 
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This phenomenon is most clearly seen in the Magellanic-type galaxies, which 
usually have rather egg-shaped bars that are often not situated at the cen- 
ter of the surrounding light distribution (de Vaucouleurs & Freeman 1970). 
The photograph of the Small Magellanic Cloud shown in Figure 4.61 illus- 
trates this phenomenon, which may also be detected in giant SB systems — 
see, for example, the photograph of NGC 4548 in the Hubble Atlas. This 
phenomenon is very ill-understood. It may be related to lop-sidedness in 
unbarred galaxies, which is discussed in §8.2.3. 


4.5 Globular cluster systems 


The Milky Way possesses at least 160 globular clusters. These objects, which 
have been intensively studied for more than a century and have played a 
prominent réle in the development of our understanding of the Universe, 
are the subject of Chapter 6. Most external galaxies also possess significant 
numbers of globular clusters and the study of the globular cluster systems 
of external galaxies has gradually become an important branch of galactic 
astronomy. 

In Chapter 6 we shall see that a typical Galactic globular cluster has 
absolute magnitude My ~ —7.3 and half-light radius R, ~ 5pc. At the 
distance of M31 (700kpc) a similar cluster would have V = 16.9 and R, = 
1.5”. As such it would be identifiable from the ground as a moderately 
faint, slightly non-stellar object. The HST would be able to discern its non- 
stellar nature at distances of order 10 Mpc, but at greater distances it would 
be merely a faint stellar object. Nevertheless, since the pioneering work of 
Haynes (1977) and Strom et al. (1981) globular clusters have been studied 
in galaxies as distant as ~ 100 Mpc. The technique by which this is done is 
as follows. 

A helpful first step is to subtract from a deep image of the galaxy un- 
der study a model of galaxy’s own luminosity distribution. This step is 
most satisfactory if the galaxy is an elliptical and therefore has a smooth 
luminosity distribution. Globular clusters will manifest themselves in the 
galaxy-subtracted image by an enhancement in the density of faint objects 
towards the position of the galaxy’s nucleus. Some of these faint objects will 
be distant galaxies, but these can be identified as such by the non-stellar na- 
ture of their images. Some of the remaining faint objects will be foreground 
stars in the Milky Way, but most of these will be redder than any globular 
cluster. When non-stellar objects and red objects are eliminated, spectro- 
scopic observations have shown that in the case of an elliptical galaxy, of 
order 90% of the remaining objects will be globular clusters (Zepf, Geisler 
& Ashman 1994). If colors are not available, it will be necessary to estimate 
the number of foreground stars in the surviving sample from a model of the 
Milky Way (§10.6), and then make a statistical correction to obtain an esti- 
mate of the number of clusters in the sample. This correction will be large 
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and uncertain for the faintest magnitude bins and will dominate the error in 
the number of clusters present. 

In general the detection of globular clusters in a spiral galaxy is very 
much harder than in a similarly distant elliptical galaxy, because it will 
be impossible to model accurately the uneven luminosity distribution of the 
disk. Moreover, even when globular clusters have been successfully identified, 
it is hard to determine the appropriate corrections to their magnitudes for 
extinction by dust in the spiral’s disk. These problems are minimized if the 
galaxy is edge-on. 


4.5.1 Globular cluster luminosity function 


Once the apparent magnitudes of the globular clusters around a galaxy have 
been determined, it is natural to plot a histogram of the numbers of clusters in 
each of several magnitude intervals. It has been found that such a histogram 
is invariably well represented by a Gaussian that has been clipped at the 
faint end by virtue of the impossibility of detecting apparently faint clusters. 
That is, if the limiting magnitude of the survey is mim, then number of 
clusters with magnitudes in (m+dm,m) is 


dN =A iraure for m < mim, (4.48) 
dm 0 otherwise. 


Figure 4.62 shows the luminosity distribution of the clusters in the E0 galaxy 
NGC 3379, which is sufficiently close (cz = 630kms~') for mim for ground- 
based counts (B ~ 24) to be significantly fainter than the peak magnitude 
mo in (4.48). The best-fitting Gaussian to the data shown in Figure 4.62 has 
Bo = 22.67 and o,, = 1.2 mag. 

If mim is too bright, the counts will not turn over and one cannot 
simultaneously determine both o,, and mo in (4.48). That is, satisfactory fits 
to the counts will be obtainable with a one-parameter family of Gaussians 
along which o,, will increase as mo increases. In these circumstances the 
value of o,, is usually set to 0, = 1.25mag on the basis that this value 
fits the data for nearby galaxies (including the Milky Way — see §10.5.1) 
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Figure 4.63 The specific frequency of globular clusters in 56 galaxies plotted against My 
(left-hand panel) and against Hubble type (right-hand panel). (From data published in 
Harris (1991) and Zepf, Geisler & Ashman (1994).] 


and then one determines the value of mo that provides the best fit of (4.48) 
to the data (Hanes & Whittaker 1987). When this is done, one finds that 
mp varies with the distance d to a galaxy in a way that is consistent with 
Mo = mo — 5logd being a universal constant. This suggests that there is a 
universal luminosity function for globular clusters of the form 


&(M) = constant x e~(@~Mo)*/207, (4.49) 


Harris (1991) finds Mo(V) = —7.27+ 0.23. This result gives rise to one 
method for determining the distances to galaxies — see §7.3.2. 


4.5.2 Specific frequency of globular clusters 


A quantity that is of considerable significance for theories of galaxy forma- 
tion is the number of globular clusters that galaxies of different types and 
sizes possess. Since by default we expect more luminous galaxies to contain 
more globular clusters than less luminous galaxies, we define the specific 
frequency of globular clusters in a galaxy of absolute magnitude My to 
be 

Sy = N,10~°4(Mv +15) | (4.50) 


where N, is the number of globular clusters that the galaxy is estimated 
to possess. Clearly, Sy = N;, for a galaxy of absolute magnitude My = 
—15 (for example the Local-Group dwarf elliptical NGC 147). Since it is 
usually possible to detect only the more luminous globular clusters, N; is 
determined by fitting (4.48) to the observed number counts and then setting 
N, = V2x0,,A. Figure 4.63 plots Sy versus absolute magnitude My and 
versus Hubble type. Several conclusions can be drawn from these plots: (i) 
at both high and low luminosity Sy varies by at least an order of magnitude; 
(ii) spiral and Magellanic galaxies never have large values of Sy; (iii) the 
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largest values of Sy occur in luminous early-type galaxies. There is one 
striking exception to this last rule: the galaxy with the largest value of Sy 
is the dwarf elliptical Fornax. The corresponding point (Sj = 73 + 12) lies 
way off the top of the scales in Figure 4.63. Actually Fornax only possess 
five globular clusters, but for this galaxy the second factor on the right of 
equation (4.50) is large because Fornax’s absolute magnitude is a piffling 
My = -12.3. 

It has been argued that the lower values of Sy measured for spiral galax- 
ies arise because globular clusters are associated with spheroidal components 
and not disks, so the specific frequency should be defined such that My in 
(4.50) is the luminosity of a galaxy’s spheroidal components (which is essen- 
tially the total luminosity in the case of an elliptical). This procedure would 
clearly increase the Sy values for spirals. However, there are two objections 
to defining Sj in this way. First reliable absolute magnitudes for spheroidal 
components are in some cases hard to obtain because the spheroidal compo- 
nents are so faint — the Local-Group galaxy M33 is such a case. Second, in 
§10.5.1 we shall see that a large fraction of the Milky Way’s globular clus- 
ters seem to be more closely associated with the disk than the halo, and, 
moreover, globular clusters are seen to be forming in the disk of the LMC 
(Mould, Xystus & Da Costa 1993, Elson & Fall 1988). Thus the conven- 
tional definition of Sy makes perfect sense and a successful theory of galaxy 
formation must explain why some luminous early-type galaxies contain very 
large numbers of globular clusters. 

A classic example of such a well-endowed galaxy?4 is M87, which sits at 
the middle of the Virgo cluster. This galaxy is estimated to possess ~ 16 000 
globular clusters, of which ~ 6000 have been identified to magnitude B = 
24.2. The total luminosity of these clusters is Ly ~ 2.6x10° Lo, which may be 
compared with the total luminosity of the galaxy Lig = 10°4(@v(©)-Mv) = 
1.0 x 101! Lo. Thus even in this unusually well endowed system, globular 
clusters contribute only a couple of percent of the system’s light. 


4.5.3 Radial density profiles and shapes 


The number of globular clusters per unit area of sky, N(R), can only be 
readily studied as function of projected radius R in well-endowed elliptical 
galaxies since (i) we have seen that it is hard to detect globular clusters 
against the irregular background of a late-type galaxy, and (ii) the surface 
density in several annuli can be accurately determined only if the galaxy 
possesses many globular clusters. The available studies of N(R) show that 
N falls with R less rapidly than does the galaxy’s luminosity profile [(R). 
There are two aspects to this difference. First, in a plot of log N versus log R 
the slope at large R is generally slightly shallower than the corresponding 
slope in a plot of log I versus log R (Harris 1987). Second, the surface density 
of globular clusters flattens off near the center in a way that J does not. For 
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example, N(R) for M87 rises very little inside R ~ 50 arcsec, whereas the 
stellar light rises steeply in to a radius < 7 arcsec (Lauer & Kormendy 1986). 
Although the Virgo elliptical?® M49 has only a third of the number of globular 
clusters that M87 possesses, the shapes of the plots of N(R) for these two 
galaxies are indistinguishable — the curve for M49 simply lies below that of 
M87. This observation clearly constrains any theory of why some galaxies are 
much better endowed with globular clusters than other, otherwise similar, 
galaxies. 

The shapes of globular clusters can only be studied in the nearest galax- 
ies since such a study requires well-resolved images. Han & Ryden (1994) 
describe the available data, which are effectively confined to the globular clus- 
ter systems of the Milky Way, M31 and the Magellanic Clouds. Figure 4.64 
illustrates the major result of these studies: while the globular clusters of the 
Milky Way and M31 are very nearly round, those of the LMC and SMC are 
significantly elongated. Specifically, half of the Milky Way’s globular clus- 
ters have undetectable ellipticity, while the modal ellipticity of the LMC’s 
clusters is « = 1—q ~ 0.85. It is likely that this difference in the shapes of 
globular clusters reflects differences in their ages - dynamical processes tend 
to make globular clusters more nearly spherical over time and most of the 
LMC’s globular clusters appear to be less than ~ 3 Gyr old (Da Costa 1990) 
while those in the Milky Way are more like 15 Gyr old (§6.1.4). 


4.5.4 Color distributions 


The ages and metallicities of globular clusters in external galaxies are clearly 
of considerable interest. In §5.4 we shall see that high-resolution spectra of in- 
dividual stars are required if one is to distinguish observationally between the 
effects of changes in the age and the metallicity of an old stellar population. 
On account of the faintness of the globular clusters of all galaxies outside 
the Local Group, such spectra are not available and the physical interpre- 
tation of the available color and line-strength data is inevitably ambiguous. 
Nevertheless such interpretations are routinely given in the literature in that 
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Figure 4.66 The mean metallicity 
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measured values of [Fe/H] are reported. These values are derived by (i) mea- 
suring colors or line-strength indices, and then (ii) assigning to the observed 
globular clusters the value of [Fe/H] of a Galactic globular cluster that has 
the same colors or line-strength indices. The implicit assumption employed 
is that the ages of all observed globular clusters are similar to those of the 
Milky Way. In some cases this is certainly untrue: young globular clusters 
are known in both the LMC and in the giant E/SO galaxy NGC 1275 that 
sits at the center of the Perseus cluster (Zepf et al. 1995). 

Lee & Geisler (1993) used Washington photometry (Table 2.1) to de- 
termine [Fe/H] for over 400 globular clusters in M87 and give references to 
much earlier work on metallicity determinations. A major result of these 
studies is that globular clusters are systematically bluer by ~ 0.25 in B—V 
than the light from the underlying galaxy. If this color difference arises from 
a metallicity difference, the globular-cluster stars have to be on average more 
metal-poor by ~ 0.5 in [Fe/H]. 

The colors of individual globular clusters within a given galaxy span a 
wide range. Figure 4.65 illustrates this result in the case of M87, by show- 
ing the metallicity distribution that Lee & Geisler derive from their colors. 
As regards shape, this is not dissimilar to metallicity distribution Galactic 
globular clusters shown in Figure 10.32, but (i) the steep rise in N([Fe/H]) 
on the left of the plot occurs at values of [Fe/H] that are larger for M87 
by ~ 0.5 than they are for the Milky Way, and (ii) on the right of the plot 
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the distribution for M87 extends to values of [Fe/H] that are larger by ~ 1. 
Thus M87 seems to possess significantly more metal-rich globular clusters 
than does the Milky Way, although the metal-poorest globular clusters in 
M87 would not be thought remarkable if they lay in the Milky Way. Fig- 
ure 4.66 suggests that the higher metallicities of the M87 globular clusters is 
connected with the large luminosity of M87 itself by showing that there is a 
general correlation between the mean metallicity of globular clusters ([Fe/H]) 
and the luminosity of the parent galaxy. 

We have seen that the integrated light from elliptical galaxies becomes 
bluer with increasing galactocentric distance R. There is now fairly convinc- 
ing evidence that the mean color of the globular clusters at R also becomes 
bluer with increasing R — see Lee & Geisler (1993) for details. 


4.6 Abnormal galaxies 


One goal of any classification scheme is to discover which galaxies are abnor- 
mal and therefore merit further investigation. The Hubble classification sys- 
tem and its modifications and derivatives refer principally to normal galaxies 
— Hubble labeled all abnormal galaxies as Type II Irregulars (Irr II galaxies). 
In this section we take a look at the objects in this ‘rag-bag’ bin. 

There are broadly two very different ways in which a galaxy can stand 
out from the crowd and be classified as ‘abnormal’: (i) it can have an unusual 
spectral distribution, such as being anomalously powerful at radio wave- 
lengths, or showing strong, broad optical emission lines; (ii) it can look 
‘funny’ in optical images perhaps by having a patchy, irregular brightness 
distribution, displaying a long luminous structure of some sort, or by pos- 
sessing an anomalously bright nucleus. We shall find that funny-looking 
galaxies frequently have abnormal spectra, so studies of abnormal galaxies 
frequently focus on the same objects irrespective of whether their starting 
point is spectroscopy or imaging. 


4.6.1 Starbursting systems 


Several astronomers have published catalogs of funny-looking galaxies (Arp 
1966, Arp, Madore & Roberton 1987, Vorontsov-Velyaminov et al. 1962— 
1974, Zwicky & Zwicky 1971). As Figure 4.67 shows, these systems often 
involve jets, tails, or ringlike structures, but are extremely varied in form; 
what they have in common is that they don’t conform to any set pattern. 
An important clue to the nature of funny-looking galaxies was provided by 
Larson & Tinsley (1978), who showed that they are more widely distributed 
in the (B — V,U — B) plane (Figure 7-19 of BT). In §5.4.2 we shall see that 
these galaxies tend to be blue and interpret this as a evidence that these 
systems have formed significant numbers of stars within the last several Gyr. 
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Figure 4.67 A page from the atlas of Arp, Madore & Roberton (1987) reproduced courtesy 
of H. Arp. 


Many of the features such as tails and ringlike structures visible in Fig- 
ure 4.67 can be convincingly explained in terms of colliding and/or merging 
galaxies — see Schweizer (1990) and Barnes & Hernquist (1992). It is likely 
that many galaxies have, at one time or another, suffered a catastrophic col- 
lision of this type (Toomre & Toomre 1972), so developing an understanding 
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Figure 4.68 The main photograph is a deep image of the galaxy NGC 7252. Two tails 
are conspicuous and the main body of the galaxy is seen to be highly irregular. The insert 
at top left shows a much shallower image of the galaxy on the same scale. The central part 
of the galaxy appears as smooth as an elliptical galaxy. [After Schweizer (1982) courtesy 
of F. Schweizer] 


of what happens when galaxies collide and merge is a central rather than a 
peripheral branch of galactic astronomy. 

A convincing demonstration that when two large spiral galaxies merge 
an elliptical is formed was provided by Schweizer (1982), who studied the 
peculiar galaxy NGC 7252 shown in Figure 4.68. Two enormous tails of 
luminosity stick out of the main body of the galaxy. Dynamical simulations 
clearly demonstrate that each of these has formed by tidal stretching of a 
galactic disk (e.g. Barnes & Hernquist 1992). So we can be pretty sure that 
NGC 7252 represents the wreckage of two large disk galaxies. In Figure 4.68 
the main body of the galaxy is clearly highly disturbed, but Schweizer showed 
that near the center of this blob the radial luminosity profile obeys the R!/4 
law rather accurately. So it looks as if NGC 7252 will in time settle down to 
being a normal elliptical galaxy. 

A rather general dynamical argument indicates that an elliptical galaxy 
cannot form from the merger of two purely stellar disks. The essence of this 
argument is that in a giant elliptical galaxy stars are more tightly packed 
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together than in a spiral, and the tightness of packing of the stars can only 
decrease during a merger. Hence the formation of a giant elliptical out of two 
disk galaxies requires the participation of gas as well as stars — gas can pack 
down during a merger. In fact, simulations indicate that during a merger a 
large fraction of the progenitor galaxies’ gas rushes headlong to the center 
of the merger remnant. The simulations cannot tell us what it does when it 
gets there, but three observations indicate that a substantial fraction of it is 
turned into stars. 

First, there is Schweizer’s demonstration that the center of NGC 7252 
obeys the R1/4 law, as well it might if sufficient new stars can form. Second, 
the spectra of suspected merger remnants frequently display Balmer absorp- 
tion, which is characteristic of A stars (Table 3.1). A stars have masses in 
the range 2 - 3Mg (Table 3.13) and have lifetimes < 1 Gyr (Table 5.2), 
so galaxies with Balmer absorption have formed stars recently. Third, in 
the 1980s the ‘IRAS’ satellite — see §8.1.6 for details — discovered that some 
galaxies have extremely large infrared luminosities. The infrared radiation 
observed from these IR-luminous systems is believed to be energy originally 
radiated by young stars that has been absorbed and then reradiated by dust 
grains (see §8.1.6). The most luminous of these objects must have converted 
much of their interstellar gas into luminous stars fairly recently, and will 
fade markedly within a Gyr as these stars burn out. Since it is found that 
galaxies that appear to be interacting are likely to be IR-luminous (Joseph 
1990), it is natural to conclude that bursts of star formation are induced by 
interactions. 


4.6.2 Systems with active galactic nuclei 


Many abnormal galaxies contain peculiar point-like sources at their centers. 
These sources are called active galactic nuclei or AGN for short. An AGN 
can be so bright that it outshines the entire surrounding galaxy. Yet AGN 
must be physically small because their huge luminosities frequently change 
dramatically in less than a year: even if the luminosity density throughout 
an extended object of size a were to double simultaneously, the observed 
brightness of the object would only adjust to its new level over a time that is 
comparable to the time rT ~ a/c that it takes light to pass from the back to 
the front of the source. Thus, the observed rapid variations in the brightness 
of AGN imply that their emitting regions are significantly less than a parsec 
across. These bizarre objects, which can pack an entire galaxy’s worth of 
luminosity into less than the space between the sun and its nearest stellar 
neighbor, provide a fascinating field of astronomical study. The details of 
our understanding of their physics lies beyond the scope of this text, and the 
interested reader is referred to the book by Robson (1996) for an introduction 
to the subject, the AGN section of Osterbrock (1989) for a more quantitative 
discussion, and the paper by Antonucci (1993) for a thorough review. Here, 
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we describe only the bare bones of what has become known as the AGN 
paradigm and its associated taxonomy. 

Carl Seyfert (1943) discovered the first class of systems with AGN. A 
Seyfert galaxy is a galaxy whose nucleus shows strong, fairly broad emis- 
sion lines of high excitation. There are two types of Seyfert galaxy. The 
spectra of type 2 Seyferts contain both permitted and forbidden lines (see 
Box 8.1) that appear to be broadened by Doppler velocities of the order of 
500kms—!. Type 1 Seyferts differ from type 2 Seyferts in that their per- 
mitted spectral lines have very broad wings, suggesting that they are Doppler 
broadened by velocities of 1000 — 5000kms~1.?6 Both the forbidden lines 
and the cores of the permitted lines in the spectra of type 1 Seyferts are of 
similar width to the lines in the spectra of type 2 Seyferts. About one-third 
of the Seyferts are of type 2. Seyferts of both types usually show strong, 
variable X-ray emission and emit strongly in the infrared. 

Since Seyfert’s pioneering work, a whole zoo of AGN types have been 
classified. As with the normal galaxies, the classification schemes fall short 
of the ideal, since they are often non-unique and somewhat subjective. For 
example, Markarian et al. (1978) made a wide-area survey which picked out 
galaxies which emitted more strongly in the ultraviolet than would be ex- 
pected from their optical emission if their luminosity were dominated by 
stars. We now know that the ultraviolet excesses of many of these Markar- 
ian galaxies is emitted by an AGN so it is not surprising that 10% of 
Markarian galaxies are also classified as Seyfert galaxies. 

Many galaxies have nuclei that resemble those of type 2 Seyferts except 
that their forbidden lines tend to arise in less highly ionized atoms. These 
low-ionization nuclear emission-line regions, or LINERs, are found 
at the centers of some 80% of Sa and Sb galaxies, and in smaller proportions 
of Sc galaxies and ellipticals. LINERs were originally treated as a distinct 
phenomenon, but they are now thought to represent the low-luminosity tail 
of the Seyfert phenomenon. 

Radio telescopes have proved powerful detectors of AGN. The angular 
resolution of early radio telescopes — which had single-dish radio antennae 
— was low with the consequence that they could not give accurate positions 
for the sources they detected. The nomenclature adopted for the sources 
detected in early studies reflects the limited spatial resolution of the ob- 
servations: for example, the first discrete source of radio emission detected 
outside the solar system was initially only localized to somewhere within the 
constellation Cygnus, and so it was designated Cygnus A. With the devel- 
opment of radio interferometers the angular resolution of radio observations 
improved dramatically. In particular a position for Cygnus A became avail- 
able that allowed Baade & Minkowski (1954) to identify the source with 


26 Seyferts which display only very weak broad components are sometimes given such 
fractional designations as Seyfert 1.5 — see Osterbrock (1989) for a thorough discussion of 
these type designations. 
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an optically-peculiar elliptical galaxy. As radio telescopes gained sensitiv- 
ity, more and more extragalactic radio sources were discovered: the most 
famous study was the third survey carried out at Cambridge [designated the 
3C catalog; Edge et al. (1959)], which covered the northern hemisphere at a 
wavelength of 177 MHz and turned up some 471 radio sources brighter than 
9 Jy; its successor, the 4C catalog, contains almost 5000 sources (Pilkington 
& Scott 1965, Gower, Scott & Wills 1967).27 Over time almost all of the ob- 
jects detected in the 3C catalog were unambiguously identified with optical 
sources. Some are associated with Galactic sources, particularly supernova 
remnants, but the majority are extragalactic in origin, and a large fraction 
are at redshifts in excess of z = 0.5. Some two-thirds of the bright extra- 
galactic radio sources are associated with what appear to be more-or-less 
normal elliptical galaxies (Spinrad et al. 1985), and these objects are known 
generically as radio galaxies. 

Radio interferometers have been able to resolve structure in most radio 
galaxies. In addition to a nuclear compact source, nearly all radio galaxies 
possess two amorphous regions of radio brightness, or radio lobes. These 
lobes are often placed roughly symmetrically on opposite sides of the nu- 
cleus and hundreds to millions of parsecs from it. At low radio frequencies 
(S 1.4GHz) the lobes are most prominent and the central source is not al- 
ways detected. Conversely, at high frequencies the nuclear source stands out 
prominently. High-quality radio images reveal that the nuclear source is of- 
ten connected to one or both lobes by a thin structure that is called a radio 
jet. In favorable cases jets can be traced over several decades in radius and 
remain narrow and remarkably straight through most of their length (see 
Figure 4.70). Occasionally, for example in the giant EO galaxy M87 (which 
is identified with the radio source 3C 274), a jet is also visible at optical 
frequencies. 

Within a lobe the surface brightness usually peaks at a well defined 
hot spot. On the basis of the relative location of these hot spots, Fanaroff 
& Riley (1974) divided radio galaxies into FR I and FR II sources. The 
distance between the hot spots of FR I sources is less than a half of the 
maximum diameter of the source, while those of FR II sources are separated 
by more than half the maximum diameter of the source. Very roughly, the 
hot spots of a FR II sources lie on the outside of the system, while those of 
a FR I source do not. Fanaroff & Riley observed that FR I sources tend to 
have P,.4 S 10245 W Hz_1, where P,.4 is radio power at 1.4 GHz, while FR II 
sources tend to have P;.4 2 10245 WHz_‘. Owen & Ledlow (1994) showed 
that FR I and FR II sources occupy distinct regions of the (Lp, P;.4) plane; 
FR I sources occupy the region that corresponds to large R-band luminosities 
and relatively small radio powers, and the FR II the other half of the plane 


27 Somewhat confusingly, the 3C Catalog is arranged in right ascension order, so that 
the source 3C 279 lies to the east of 3C 278, while the 4C catalog is sorted by declination, 
so the source 4C 34.16 is the sixteenth source at a declination of 34 degrees. 


4.6 Abnormal galaxies 247 


Figure 4.69 The Fanaroff & Riley 
(1974) morphological classification 
of radio sources leads to a sharp di- 
vision of these sources in a plot of 
radio luminosity at 1.4GHz against 
R-band luminosity. FR I sources are 
plotted as filled triangles and FR II 


Pig 


22 sources as open squares. [After Led- 
~Rl —ke2 ~23 —24 —25 low & Owen (1996) from data kindly 
Mp supplied by M. Ledlow] 


Figure 4.70 Image showing the emission from the radio galaxy 
Cygnus A at 0.06m as mapped by the VLA. Note the narrow, 
straight jet joining the central unresolved emission to the largescale 
radio lobes. [After Carilli & Barthel (1996) using data kindly pro- 
vided by C. Carilli and R. Perley] 


(see Figure 4.69). Like Seyferts, radio galaxies can be divided on the basis 
of their optical spectra into narrow-line radio galaxies which emit only 
the narrow emission lines characteristic of type 2 Seyferts, and broad-line 
radio galaxies which also emit the broad lines seen in the spectra of type 
1 Seyferts 

The huge power of the radio lobes is believed to flow from the AGN 
along the narrow channels that are defined by the jets. Within the jets, 
energy is stored as the ordered kinetic energy of bulk flow. The observed 
radio waves are emitted by the synchrotron process (see §8.1.4) after this 
energy has been converted by shocks into the random motions of electrons 
and ions. This crucial disordering of the kinetic energy that is emitted by 
the AGN occurs in part within the jets but is believed to be concentrated 
in the hot spots. Models of radio lobes show that enormous quantities of 
random kinetic energy must be stored within the lobes in plasma that has 
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past through the hot spots. 

Although most radio galaxies appear to be normal ellipticals when ob- 
served at optical wavelengths, approximately 10% contain exceptionally- 
bright central nuclei (Spinrad et al. 1985). Galaxies that contain such bright 
central nuclei (which also include Seyferts) have been given the generic title 
of N galaxies (Morgan 1972). 

At optical wavelengths the locations of some radio sources are marked 
by an unresolved point source rather than an elliptical galaxy. It was initially 
believed that these quasi-stellar radio sources, or quasars for short, were 
some strange new kind of star within the Milky Way. Their spectra, however, 
contain broad, strong emission features, totally unlike the spectral lines of 
normal stars. Schmidt (1963) conclusively demonstrated that quasars are 
extragalactic by identifying such features in the spectrum of 3C 273 with 
emission lines, similar to those seen in type 1 Seyferts, but redshifted by 
z = AX/A = 0.16. Once this breakthrough was made, the spectra of other 
quasars were recognized as highly redshifted. The huge Doppler velocities 
implied by these redshifts strongly suggests that quasars do not lie within 
the Galaxy since they are moving much too fast to be confined by the Gal- 
axy’s gravitational field. Furthermore, if they were being shot out from the 
Galactic center, we would expect to see some blueshifted spectra and none 
has ever been found. 

The similarity between the spectra of quasars and those of type 1 
Seyferts suggested a more natural interpretation of quasars: they are dis- 
tant active galaxies whose large redshifts simply reflect the cosmological ex- 
pansion of the Universe (see §7.1). They are so distant that their optical 
angular sizes are comparable to the regions over which the light of their nu- 
clei is spread by seeing (§4.2.2). Consequently, the brilliance of their nuclei 
completely swamps their stellar light and from the ground they appear to be 
unresolved point sources.?® 

Radio astronomy has also played a key role in identifying a further class 
of AGN, ‘BL Lac objects’, or ‘BL Lacs’. BL Lacertae is an unresolved optical 
point source which can vary in luminosity by an order of magnitude in less 
than a month. It was originally believed that this object was some new 
kind of extremely variable star — hence its stellar name. Early spectra of 
BL Lacertae showed a non-thermal power-law spectrum that was devoid of 
spectral lines in either absorption or emission. Hence the heliocentric velocity 
of the object could not be deduced from these spectra. Radio observations, 
however, suggested that it is an AGN by showing that BL Lacertae is a 
strange compact radio source which varies rapidly and is strongly linearly 
polarized. This linear polarization is associated with large Faraday rotation 
(§8.1.5), and this suggests that BL Lacertae lies outside the Milky Way. 
Adams (1974) finally proved that BL Lac is indeed an AGN by showing that 


28 Actually 3C 273 has an extended optical image, but the extended structure is due 
to a jet similar to that seen in M87. High-resolution radio observations also show this jet. 
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the bright point source is surrounded by a faint nebulosity whose brightness 
distribution and color are consistent with its being an elliptical galaxy. The 
spectrum of this nebulosity is that of a normal elliptical galaxy at a redshift 
of ~ 0.07 (Miller, French & Hawley 1978). AGN which like BL Lacertae 
combine rapid variability, a non-thermal optical spectrum and strong radio 
emission are called BL Lacs. 

BL Lacs are now frequently bracketed with another class of AGN, the 
optically violently variable quasars, or OVVs for short. These objects 
share many of the properties of BL Lacs: they are strong radio emitters; 
they can vary in brightness by large factors on timescales of weeks; and 
their radio and optical emissions are strongly polarized. The distinguishing 
factor between OVVs and BL Lacs is that OVVs possess the broad optical 
emission lines that are characteristic of quasars. However, the distinction 
between the two classes is made less than clear-cut by the variability of all 
these objects: an object’s emission lines may stand out clearly when the 
underlying continuum is faint, but be almost invisible when the continuum 
is bright. Hence at different times a single object may be classified as an 
OVV or as a BL Lac. The two types of objects are therefore often grouped 
into a single category known as blazars. 

A further class of powerful AGN comprises the quasi-stellar objects 
or QSOs, which have the same observable properties as quasars except that 
they are not strong radio sources.?® These radio-quiet AGN are very similar 
to type 1 Seyferts, except that their nuclei are optically so bright that they 
outshine the surrounding galaxy. Thus, the distinction between these two 
classes is somewhat artificial, but it is customary to describe objects with 
absolute magnitudes brighter than My ~ —23 as QSOs, and fainter radio- 
quiet AGN as Seyferts. 

Since QSOs have a stellar appearance and no radio signature, distin- 
guishing them from stars is difficult. However, the non-thermal continua of 
AGN do not cut off in the ultraviolet as stellar spectra do, so QSOs stand out 
against stars by their very blue colors.3° Further, the broad emission lines of 
QSOs are apparent in even poor-quality spectra, so QSOs can be identified 
in objective prism surveys, which disperse the light of each object in a 
field into a low-dispersion spectrum. Finally, QSOs, like quasars, are strong 
emitters at X-ray wavelengths, so surveys carried out with X-ray satellites 
such as ROSAT have proved a fruitful method for identifying QSOs. By 
gathering the sources detected by these various techniques, large catalogs 
of QSOs have been produced: Hewitt & Burbidge (1993) published a list 
of 7315 objects detected by one or more of these methods. Such studies 
have shown that most powerful AGN are not strong radio sources: for every 


29 There is a certain amount of confusion in the nomenclature of these objects: some 
authors refer to all bright AGN as quasars. The original acronym definition of quasars is 
then ignored, and such oxymoronic terms as “radio-quiet quasars” appear. 

30 At redshifts z > 2.5, quasars become faint in the U band on account of Lya and 
Lyman continuum absorption. Hence high-redshift quasars do not have blue optical colors. 
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quasar, there are perhaps twenty QSOs. Surveys also confirm the rarity of 
BL Lacs: only ~ 1% of the objects in the Hewitt & Burbidge (1993) catalog 
are classified as a BL Lac. However, some care must be taken in interpreting 
such statistics, since BL Lacs are even harder to identify than QSOs, so they 
may be under-represented in catalogs of identified objects. 


4.6.3 Host galaxies of AGN 


Since this book is primarily concerned with the large-scale structure of galax- 
ies, we now turn to the morphologies of the galaxies that host AGN. In the 
case of nearby low-power AGN such as LINERs and Seyfert nuclei, it is 
straightforward to produce images of the surrounding galaxies. Although 
some lower-powered LINERs reside in the centers of elliptical galaxies, the 
majority of these systems appear to be fairly normal spiral galaxies, with the 
usual complement of bulge, disk, and, in some cases, a bar. Radio galaxies 
are also relatively straightforward to image, and almost without exception 
they are found to be elliptical galaxies, although in many cases their mor- 
phologies are somewhat disturbed. 

The host galaxy of a quasar or a QSO is much harder to image, since 
it is obscured by the glare of the central point source. However, several 
technological advances have enabled astronomers to image the host galaxies 
of even powerful AGN — these advances include the introduction of CCDs 
and infrared arrays, the limitation of the degradation of ground-based ob- 
servations by seeing, and especially the refurbishment of the HST. Quasars 
all seem to be hosted by giant elliptical galaxies. Consequently they repre- 
sent the luminous end of the radio-galaxy phenomenon. The morphologies 
of the QSO host galaxies is less clear-cut: although some of these galaxies 
are giant ellipticals, others are probably disk galaxies since their luminosity 
profiles are best described by the exponential law (4.20). Thus it is not clear 
whether QSOs are more closely related to quasars or Seyferts. There are in- 
dications that the host galaxies of some luminous AGN have been disturbed 
by interactions with other galaxies: imaging with HST has revealed multiple 
nuclei, close companions and tidal tails in several systems (see Figure 4.71). 

A reasonably consistent picture of AGN host galaxies is beginning to 
emerge, but the number of objects studied is still small and by no means 
all issues are resolved. To every apparent rule there are exceptions: for 
example, the vast majority of BL Lacs, like other radio-bright AGN, ap- 
pear to be hosted by elliptical galaxies, but an HST image of one BL 
Lac, PKS1413+135, reveals that it is located in an edge-on spiral galaxy 
(McHardy et al. 1994). Even more unexpectedly, a V-band study of eight 
nearby bright quasars and QSOs with HST revealed the host galaxies of only 
three of these objects — no stellar light could be detected in the other five 
systems (Bahcall, Kirhakos & Schneider 1995). These observations of ap- 
parently “naked” AGN imply that any host galaxies must be significantly 
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Figure 4.71 A HST image of the 
quasar PKS2349—-014. Note the 
strongly disturbed morphology of the 
diffuse host galaxy light surround- 
ing the bright point-like quasar, and 
the large number of nearby galaxies. 
[Figure courtesy of STScI] 


fainter than average field galaxies. A subsequent study by Disney e¢ al. 
(1995) based on R-band HST imaging was more successful in detecting QSO 
host galaxies, and concluded that these objects are found in elliptical galaxies 
of average brightness. Since the objects observed were amongst the bright- 
est nearby AGN, however, yet they are housed in galaxies of no more than 
average brightness, it seems that there is no direct correspondence between 
the luminosity of an AGN and the luminosity of its host galaxy. 


4.6.4 The unified model of AGN 


While it is in principle possible that the different types of AGN that we 
have encountered are physically unrelated objects, it is now generally ac- 
cepted that they are physically very similar objects and that the differences 
in their observational properties may be largely explained by three factors: 
(i) some objects are significantly more luminous than others; (ii) some objects 
produce jets, while others don’t; (iii) a given AGN can have very different 
observational properties depending on the direction from which one views it. 
This last chameleon-like property of AGN reflects their strongly anisotropic 
radiation patterns. 

Direct evidence that the radiation pattern of AGNs is anisotropic comes 
from the discovery near some active galaxies of wedge-shaped regions of 
photo-excited gas (see Figure 4.72). The sharp edges of these wedges are 
interpreted as the boundaries between gas that is exposed to ionizing radia- 
tion and gas that is shielded from it. The wedge-shaped geometry suggests 
that the ionizing radiation is being beamed out from an AGN in a cone. 

Figure 4.73 illustrates the geometry inferred from these observations. 
A powerful energy source, probably a supermassive black hole, lies at the 
center of a torus of gas and dust. The source’s radiation is absorbed by 
the torus in all directions except along a cone around the torus’s symmetry 
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Figure 4.72 HST image showing 
the emission from ionized material 
in the core of the Seyfert 2 gal- 
axy NGC 1068. [Figure courtesy 
of STScI] 


| 
| Central engine 


Broad-line region 
cloud 


4 = | , Narrow-line region 
| cloud 


Figure 4.73 Schematic cross-section of the geometry invoked by 
the unified AGN model. The high velocity, high density clouds 
in the broad-line region are obscured unless viewed from close to 
the axis of symmetry. The low velocity, low density clouds in the 
narrow-line region, which are ionized by the radiation from the 
central engine, are visible from all directions. 
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axis. Physically, this cone is probably a vortex tube akin to the one that 
often forms within the plug-hole when a bath-tub is emptied. When we can 
see down the vortex tube, we classify the system as a type 1 Seyfert or a 
QSO depending on luminosity because we see dense, fast-moving gas that is 
located in the broad-line region close to the AGN — this gas is responsible 
for the broad permitted lines in the object’s spectrum. When we cannot see 
down the vortex tube, we classify the object as a type 2 Seyfert because in 
this case the spectrum is dominated by emission from low-density, relatively 
slowly moving, photo-excited gas that lies in the narrow-line region at 
some distance from the central energy source, as shown in Figure 4.73. 

This picture gained enormously in credibility when Miller & Antonucci 
(1983) obtained a spectrum for the component of the light of NGC 1068 which 
is polarized with its electric vector perpendicular to the long-axis of the cone 
of emission. They found that the permitted emission lines in this polarized 
spectrum have broad wings similar to those seen in a type 1 Seyfert. The 
significance of this result is the following: since NGC 1068 is a type 2 Seyfert, 
it should be oriented such that we cannot see the region within which the 
wings of broad emission lines are formed. However, photons emitted deep 
in the vortex can pass up it and then be scattered into our line of sight 
by free electrons that lie within our field of view. In order to be scattered 
in this way, light must be polarized with its electric vector perpendicular 
to the plane containing our line of sight and the direction to the original 
point of emission. Thus, by measuring the spectrum of the component of the 
radiation of NGC 1068 that is polarized in this direction, Miller & Antonucci 
were picking out precisely light that had been scattered up from deep in the 
vortex. The unified model therefore successfully predicted that this spectrum 
should contain the emission from the broad-line region that is usually hidden 
in type 2 Seyferts. 

Unification in terms of orientation also accounts successfully for the phe- 
nomenology of radio-loud AGN. We imagine radio jets emerging along the 
vortex tube shown in Figure 4.73. When a jet is pointing towards us we clas- 
sify the system as a blazar because we then see to very small radii, deep in 
the core of the AGN, and the observed brightness can vary rapidly. Near the 
central engine jets move at relativistic speeds. In a blazar this motion is al- 
most straight toward us with the result that synchrotron emission is Doppler 
shifted to markedly higher frequencies. Domination of the optical spectrum 
by such Doppler-boosted synchrotron radiation explains the featureless op- 
tical continua of BL Lacs. When our line of sight does not pass right down 
the jet, our picture of the galaxy is dominated by the radio lobes and we 
classify the system as a radio galaxy. The comparative rarity of blazars is 
explained by the small fraction of lines of sight that pass right down a jet. 
Narrow-line radio galaxies are interpreted as systems that are seen rather 
nearly edge-on so we do not see far down the vortex tube, while broad-line 
radio galaxies are systems that are seen sufficiently pole-on that we see deep 
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Figure 4.74 Very high dynamic range radio map of the blazar 
3C 371 obtained using the VLA. Note the faint extended emission, 
and its limited angular extent (which amounts to only ~ 40kpc 
at the distance of this source). [Reproduced from Wrobel & Lind 
(1990)] 


down the vortex tube, although not so precisely pole on that we look right 
along the jet. 

We can test the unified model of radio-load AGN observationally: if 
blazars are nearly end-on radio galaxies, they should have radio lobes but 
in projection these should lie closer to the nucleus than they do in normal 
radio galaxies. In fact, in many cases one would expect a projected lobe to 
surround the nucleus. Since the diffuse radiation from lobes should come 
from plasma that is approximately stationary in the galaxy’s rest frame, it 
should be fairly isotropic. Hence the lobes of blazars should be of comparable 
luminosity to those of normal radio galaxies. Observationally, the detection 
of diffuse radio emission against the glare of the nucleus is far from trivial, 
however, just as it is difficult to image QSO host galaxies optically. None the 
less, Figure 4.74 shows that the blazar 3C 371 does possess diffuse emission of 
limited angular extent, just as the unified model predicts. Moreover, in this 
and other blazars the total power associated with diffuse emission is com- 
parable to that observed from the lobes of normal radio galaxies. Thus the 
data are consistent with the distinction between blazars and radio galaxies 
being purely one of orientation. 

The unified model predicts that the mid-infrared and hard X-ray emis- 
sions of an AGN should be rather isotropic, because the jet should not 
dominate such radiation and the accretion torus, which obscures the cen- 
tral source at optical wavelengths, should be transparent to hard X-rays and 
mid-infrared radiation. Since hard X-ray and mid-infrared luminosities differ 
by orders of magnitude between the weakest and strongest AGN, it is clear 
that AGN differ in intrinsic luminosity as well as in orientation. 
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The power emitted by an AGN is believed to be derived from the ac- 
cretion of matter on to a black hole. For a black hole of mass Mpp an 
approximate upper limit on this power is given by the Eddington lumi- 
nosity. This is (see Problem 4.12) 


T= 3s 10 (4) is, (4.51) 
(0) 


For example, a bright QSO with a luminosity of ~ 10!” Lo requires a black 
hole of mass 2 3 x 107 Mo. One possible explanation for the range of AGN 
luminosities is that AGN are powered by black holes which span a wide range 
of masses and are radiating at close to their respective Eddington limits. 
However, since the luminosity of an individual AGN can vary by orders 
of magnitude on timescales of days to months, and there is no mechanism 
by which the responsible black hole can vary its mass in a corresponding 
manner, the mass of the black hole cannot be the sole factor which dictates 
the luminosity of the AGN. 

The primary factor which affects the luminosity produced by a given 
black hole is the rate at which it accretes material. Once a black hole has 
gobbled up everything that lies within its sphere of influence, it must shut 
down for lack of fuel. If its host galaxy now undergoes a close encounter with 
a companion galaxy, the distribution of gas and stars around the black hole 
may be radically restructured, and fresh material may be enter the black 
hole’s sphere of influence. Thus, there may be a link between the brightness 
of AGN and the length of time since it was last gravitationally disturbed. 
Support for such a connection comes from the large fraction of QSO host 
galaxies with disturbed morphologies mentioned in 84.6.3. 

Many other questions about AGN are also as yet unresolved. Perhaps 
the biggest gap in our understanding at present is that we have no good 
explanation as to why some AGN produce radio jets while others do not. 
The fact that radio-loud AGN are generally located in elliptical galaxies 
while radio-quiet AGN tend to be found in spirals must be relevant. The 
causal factor may be the very different structures of the interstellar media of 
ellipticals and spirals (see Chapter 8). Another possibility is that the black 
holes in ellipticals have different spin properties from those in spirals and 
that this difference renders them more liable to produce jets. 


Problems 


4.1 For a Gaussian PSF, and an image that is circularly-symmetric about 
the point R = 0, show that equation (4.6) can be written 


a 2 12 ' i 
Tapp (R) = | dR’ R'I,(R’) exp ( = atF) [oo exp ( _ AE cot). 


20? a2 


Hence obtain equation (4.8). 


256 Problems 


4.2 If the coefficient a3 in equation (4.10) is greater than zero, what does 
this tell us about the shape of the isophote? 


4.3 One way of fitting an ellipse to an isophote is to perform a least-squares 
fit of the polar equation for an ellipse, R~-? = C — Acos2¢-— Bsin 24, to the 
points (R;, ¢;) that lie on the isophote. Express the semi-major axis length, 
a, the ellipticity, «, and the position-angle of the major axis, ¢p, in terms of 
A, Band C. 


4.4 Peletier et al. (1990) measure the diskiness of an isophote by fitting an 
ellipse to it, and then writing the surface brightness as a function of angle ¢ 
around this ellipse as a Fourier series, 


4 4 

I(¢) = In(a) (1 + 5° C,,.cos(nd) + S> Sp sin(n6) } ; (4.52) 
n=1 n=1 

where a is the semi-major axis of the ellipse. Assuming that all the C, and 

Sp are small, show that 


a4 C4 

— 2 4.53 
a 0.41n10(du/da)’ eee) 
where ju(a) is the galaxy’s surface brightness (measured on a magnitude scale) 
at distance a down the major axis, and a,/a is the diskiness parameter as 
defined in equation (4.10). 


4.5 Using equation (4.12), show that if the luminosity density follows a 
power law 7 ~ r~® then the surface brightness I varies with projected radius 
RasI ~ Rote), 


4.6 Show that the luminosity L(R) that is predicted by the R'/4 law to lie 
inside projected radius R is 


2 3 4 5 6 7 
= ery sg ta 2) 
1(R) = L(00)]1 e GQt+yt G+ b+ b+ F448 , (4.54) 


where y = 7.67(R/R,)'/4. Hence verify equation (4.19), and show that the 
mean surface brightness inside R = R, is (I). = 3.611.. 


4.7 Show that the matrix A defined in Box 4.2 is M times the covariance 
matrix of the observations. Hence show that its eigenvalues, 4, are the values 
taken by the quantity S for each eigenvector n. 


4.8 An edge-on view of the fundamental plane is required with log oo plotted 
horizontally. What linear combination of log R, and (J) should be plotted 
vertically? 
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4.9 Obtain an expression for the total luminosity L of the model defined by 
equations (4.44). Show that the edge-on surface brightness of this model is 


I(R,z) = 1(0,0)5-Ky(R/Ra) sech?(z/2z0), (4.55) 


where K, is a modified Bessel function. Determine the dependence of I(0, 0) 
upon L and the model’s scale lengths. 


4.10 In an isothermal distribution of particles, the particle density varies 
as exp(—G®), where ( is the inverse temperature of the system, and © is 
the gravitational potential. Investigate the isothermal particle density, n(z), 
when ®(z) is generated by: (a) an infinitesimally-thin uniform-density sheet 
occupying the plane z = 0; and (b) an infinite medium of uniform density. 


4.11 Consider the globular cluster population of a galaxy at a distance d 
from us. Let &(m)dm be the luminosity emitted by clusters with apparent 
magnitudes in the range (m+dm,m). Given that the luminosity function of 
the globular clusters satisfies equation (4.49), show that U(m) is a Gaussian 
distribution that peaks at m; = mo — 0.9202,. Hence, show that the total 
luminosity of N; globular clusters is 


2,2 
Ly = Nvexp (7) x 107!8(4/pe) (4.56) 
™m 


Suggest how one might observationally determine m, for a globular cluster 
system so distant that mo is fainter than the limiting magnitude of cluster 
counts. 


4,12 Consider a fully ionized and optically thick cloud of material that lies 
near a point source of mass M and luminosity L. Given that the dominant 
interaction of the cloud with the radiation field is Thomson scattering by free 
electrons (§8.1.4), show that radiation pressure on the cloud exactly balances 
gravity when L equals the Eddington luminosity 


4nGMc & 
OT N.’ 


Leaa = (4.57) 


where &/N, is the mass per free electron in the cloud. Hence derive equation 
(4.51). 
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Evolution of Stars and Stellar 


Populations 


The theory of stellar structure and evolution is fundamental for our under- 
standing of almost every aspect of star clusters and galaxies. We start this 
chapter by reviewing the understanding of the color-magnitude (CM) dia- 
gram that has emerged from the theory. Then we summarize its results 
concerning the origin of the chemical elements - where and how they were 
synthesized, and in what quantities. Finally, after describing practical tech- 
niques for determining the physical and chemical characteristics of stars from 
observations, we use our results to predict the observational characteristics 
of entire populations of stars within which the star-formation rate has some 
simple time dependence. These models will assist greatly in the interpreta- 
tion of observations of both the Milky Way and external galaxies. 

Since we only use the results of the theory of stellar structure as interpre- 
tive tools, we merely state them without detailed justification. The subject’s 
principles and detailed results are ably expounded in the monographs of Kip- 
penhahn & Weigert (1990), Shapiro & Teukolski (1983), Clayton (1968) and 
Cox & Guili (1968), while valuable review articles include Iben (1967), Iben 
(1974) and Chiosi, Bertelli & Bressan (1992). 
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5.1 Stellar evolution and the CM diagram 


In §3.5 we defined the CM diagram and saw that stars congregate near par- 
ticular points and lines of its plane. The goal of the theory of stellar evolution 
is to explain the arrangement of stars in the CM diagram and to reproduce 
certain other regularities, such as the main-sequence (MS) mass-luminosity 
relation (Table 3.9). A basic theoretical picture of stellar evolution can be 
developed by assuming that stars are spherically symmetric and in hydro- 
static equilibrium, and by writing a set of four nonlinear partial differential 
equations that describe the structure of a star as a function of radius from 
the star’s center and as a function of time. One of these equations describes 
the increase with radius r of the mass interior to r, another describes the 
increase with r of the luminosity passing through the sphere of radius r, and 
the other two equations describe the decrease with r of the pressure and 
temperature within the star. When these equations are supplemented with 
an equation of state for the star’s gas and the assumption that the luminos- 
ity is generated at the center, one can show (as was done by Russell, Vogt, 
Eddington, and others) that the structure of the star is determined by its 
mass and its chemical composition. 

One of the key factors affecting the course of stellar evolution is the 
nature of the energy-production mechanism. Early in the 20‘ century, it was 
believed that the energy radiated by a star was produced by the conversion 
of gravitational energy into thermal energy by a (relatively) slow contraction 
of the star. But, when it was discovered, on the basis of geophysical evidence, 
that the Earth is at least 1 Gyr old and that therefore the Sun must be at 
least this old, it was realized that the time interval over which gravitational 
energy release could sustain the Sun’s luminosity is two orders of magnitude 
too short to match the geophysical scale. The problem was solved when it 
was recognized that the required energy could be provided by thermonuclear 
fusion of light elements into heavier elements at the centers of stars. In 
particular, the MS can be identified with an episode of conversion of hydrogen 
(cosmically the most abundant element) into helium. 

There are two important channels for the conversion of hydrogen to 


helium. The p-p chain starts with this pair of nuclear reactions:! 
ptp—*Ht+et+y (5.1) 
7H +p — 3He +74. ; 


In the final stage of the p-p chain the He created by the reactions above 
is destroyed either following the collision of two *He nuclei or through the 
collision of a He nucleus with a +He nucleus: 


3He + *He > 7Be +7 
either *He+*He —*He+2p_ or Bee” 3 Liv (5.2) 
Ti p+ 27He. 


1 See §5.2.1 for an explanation of the notation used for nuclear species. 
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The other important channel for the conversion of hydrogen to helium 
is the CNO-cycle, which involves the reactions? 


2O4 p> BN+ 4 
UN Bopet +p 
BOLpoMN+ 
MN+p > FO+¥ 
BQ _, BN aet4y 
BN 4p 3 2C +4 4He. 


(5.3) 


The net effect of these reactions is to convert four protons into a helium 
nucleus; the C, N, and O nuclei catalyze the reaction without themselves 
being consumed. 

The speed with which both the p-p chain and the CNO cycle convert 
hydrogen to helium is proportional to the square of the density and increases 
rapidly with the temperature within the star. The CNO cycle is the more 
temperature sensitive of the two processes, with the result that it dominates 
in the hottest stellar cores. The p-p chain is the dominant process in the 
Sun. 

When the equations of stellar structure are solved, zero-age, chemically 
homogeneous, H-burning stars of different masses are found to be arranged 
along the MS as observed; by changing the assumed chemical composition of 
the material, families of theoretical main sequences can be generated. While 
a star is on the MS, its luminosity is supported by the slow conversion of 
hydrogen into helium. When an appreciable fraction of the original hydro- 
gen in a star’s core has been consumed, the star moves away from the MS in 
the first phase of a series of structural changes. These reflect changes in its 
chemical composition that are produced by successive episodes of thermonu- 
clear burning. The structural changes in stars as they evolve are manifested 
in the CM diagram. Hence, by analysis of CM diagrams, we are able to infer 
significant information about stellar evolution. 

When using the results of stellar-evolution calculations, it is worthwhile 
bearing in mind that the errors and uncertainties in these calculations ac- 
cumulate from early to late evolutionary stages. The largest sources of un- 
certainty lie in the treatment of convection and mass loss. Convection is 
assumed to occur at some radius r within a star when the Schwarzschild 
criterion is satisfied at r. The Schwarzschild criterion is that the temper- 
ature gradient that is required at r for the entire heat flux to be carried by 
the outward diffusion of photons is greater than the temperature gradient 
that one would have at r if the specific entropy of the gas was constant in 


? The last reaction in the sequence (5.3) gives only the most important process for the 
recovery of 12C from 15N; the CNO cycle can have other endings. 
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the neighborhood of r. In these circumstances cells of hot gas move upwards 
and are replaced by cells of cooler gas that fall downwards. This interchange 
of gas transports energy extremely efficiently, so evolutionary calculations 
assume that in a convective region the temperature gradient is the minimum 
gradient at which convection will work, namely the adiabatic gradient. 

In the standard phenomenological model of convection, which is known 
as mixing length theory, all rising and falling convective cells travel one 
mixing length / before dissolving into the ambient medium. This length- 
scale is reduced to a dimensionless parameter a = 1/H by dividing it by the 
local scaleheight? of the star’s atmosphere, H. Stellar-evolution theory is 
sensitive to the assumed value of a because this parameter determines the 
speed with which convection erodes gradients in chemical composition. 

Convective regions usually have a top and a bottom within the star, and 
these boundaries often coincide with a region of exceptionally strong chem- 
ical inhomogeneity. For example, the lower boundary may coincide with a 
shell within which nuclear burning is taking place, with the consequence that 
a particular nuclear fuel is abundant above the shell and exhausted below 
it. Now a convective zone cannot have a sharp boundary, since turbulence 
generated by rising and falling cells is bound to spill over into adjacent con- 
vectively (marginally) stable regions. There is now considerable evidence 
that such convective overshoot plays an important role in the evolution 
of stars and the results of stellar-evolution calculations are sensitive to the 
size, 3, of the overshoot region (measured in units of H) that is assumed to 
exist. 

The standard strategy is to determine appropriate values for a and 
by tuning stellar structure calculations so that they yield the best possible 
match to the observational characteristics of the Sun and other well-observed 
stars. On account of this empirical calibration, we should not to read too 
much into the ability of mixing-length theory to model the properties of 
other stars. It is also not obvious that the appropriate values of the theory’s 
parameters are the same for stars of every type; by picking a single value we 
may be distorting our predictions in a systematic way. 

Except at the extreme ends of the sequence, MS stars only contain 
small convective regions, so the uncertainties in their structure are small. 
By contrast, stars that have evolved on to the subgiant branch and beyond 
have large convective envelopes and correspondingly large uncertainties in 
their structure. 

Matter flows away from most stars in the form of a stellar wind — a 
roughly spherically symmetric flow of plasma from the surface of the star 
out into the interstellar medium. This mass loss, like convection, becomes a 
more important effect as the star evolves away from the MS, where mass loss 
tends to be very slow. At the present time it is extremely hard to predict 


3 The scaleheight is the distance H = (dln P/dr)—! over which the pressure P declines 
by a factor e = 2.7... 
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the rate of mass loss theoretically, and it is conventional to subject a stellar 
model to a mass-loss rate that has been estimated from observations of stars 
that are at a similar observational stage. 

Next in importance to uncertainties introduced by convection and mass 
loss, is uncertainty in the rate at which elements heavier than H settle to- 
wards the centers of stars: in thermal equilibrium a star would be stratified 
with the heaviest elements at the center and the lightest, namely H, floating 
on the surface. To a large extent nuclear burning establishes this sort of 
stratification naturally, but most of the He in any star is primordial, and 
starts out homogeneously mixed with H. The detailed results of evolutionary 
calculations depend on the assumed rate at which He sinks towards the cen- 
ter. The downward motion of He (helium diffusion) reduces the fraction 
of H in the core and brings forward in time the moment at which nuclear 
burning exhausts the supply of H in the core and a star turns off the MS. 

Any error that is introduced into a model at one evolutionary stage when 
either convection, mass-loss or diffusion is inadequately described, clearly 
undermines the validity of the model at all subsequent evolutionary stages, 
even if no further error is introduced. 


5.1.1 Placing models in the CM diagram 


The final difficulty in comparing stellar-evolution models to observed CM di- 
agrams is the conversion of the temperature and bolometric luminosity of a 
model into a color and an absolute magnitude. There are two fundamentally 
different approaches to this task. In one approach, we assemble a sample of 
nearby stars for which effective temperatures, absolute magnitudes, metal- 
licities and colors have been observationally determined. Then we assign to 
our theoretical model the spectral properties of the star within the sample 
that has the nearest values of M,.; and Teg. In fact, by interpolating in a 
table of values of Teg and colors (such as Table 3.7) that has been assembled 
from the observational results, we can estimate the colors of our theoretical 
star rather accurately even if our observational sample does not contain a 
precisely identical star. 

Clearly, the procedure just outlined for determining the colors of the- 
oretical stars is viable only if a sufficient sample of well-observed stars of 
similar metallicity can be assembled. Unfortunately, this requirement effec- 
tively restricts the method to stars that do not differ too much in metallicity 
from the Sun. In particular, very metal-rich or metal-poor stars, stars that 
have markedly different abundance ratios from those in the Sun, and very 
old stars are very scarce near the Sun, although they may dominate the light 
from a distant object. Since reliable bolometric magnitudes and effective 
temperatures are not available for large samples of such stars, the spectral 
properties of their theoretical counterparts cannot be obtained by interpola- 
tion in an appropriate table of observational values. An alternative procedure 
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involves tuning a stellar-atmosphere code to the star’s effective temperature 
and bolometric magnitude, and then determining the star’s colors by ‘ob- 
serving’ the resulting theoretical spectrum in the computer (Tripicco & Bell 
1995). Since the spectra of many stars contain enormous numbers of spectral 
lines, this procedure requires both a sophisticated stellar-atmosphere code 
and a mass of reliable atomic and molecular data. The cooler and more 
metal-rich the star is, the greater the number of lines, and hence the more 
formidable the theoretical problem that must be solved if accurate colors are 
to be determined in this way. 


5.1.2 Features in the CM diagram 


Let us now indicate what, according to the theory of stellar structure, is 
happening in key areas of the CM diagram (§3.5). These areas are marked 
in Figure 5.1 and summarized in Table 5.1. 

The lives of stars are governed by the same law that drives star clusters 
and galaxies: let heat flow from the inside to the outside, so that the inside 
may become denser and hotter.* As the inside of a star becomes denser and 
hotter, progressively heavier nuclei form — the nuclei at the center change 
successively from those of H to He to a mixture of C and O to Si and so on 
eventually up to Fe if the star is sufficiently massive. The energy released 
in the formation of a given nuclear species can for a while fulfill the law’s 
requirement for an outward heat flux without further compression of the inte- 
rior material. When the raw material for this process is exhausted the period 
of stasis is followed by further central contraction. Just as traffic builds up 
near road-works on a highway, so each period of steady nuclear burning is 
associated with a concentration of stars in the CM diagram. Quantitatively, 
the number of stars at each burning stage varies roughly as 7, the time it 
takes to exhaust the nuclear fuel burnt at that stage. 

Of the concentrations of stars in the CM diagram that are associated 
with these static periods, the most important is the MS — see Figure 5.1 
and §3.5. Here stars are burning H to He in their cores. This is the longest 
period of a star’s life because it is a period of relative fuel-economy: more 
nuclear energy will be released after leaving the MS than on it, but after 
leaving the MS a star rapidly increases in luminosity with the result that it 
rather quickly exhausts its fuel — see Figure 5.4. The position occupied by 
a star on the MS depends on its mass: more massive stars lie up and to the 
left of less massive ones — see Figure 5.2. The locus of the MS depends on 
metallicity: the MS of a metal-poor population lies below and to the left of 
the MS of a metal-rich one. 

When a massive star leaves the MS it moves to the right in the CM dia- 
gram and not long afterwards dies in a supernova explosion. Running nearly 


41In stellar dynamics, this law leads to the ‘gravithermal catastrophe’ — see §8.2 of BT 
or Lynden-Bell & Wood (1968). 
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Table 5.1 Features in the CM diagram 


Feature 


Main sequence Core H burning 


Subgiant branch 
Red giant branch 


Horizontal branch 


Red clump (RC) 


Asymptotic giant 


(Ms) 
(SGB) 
(RGB) 


(HB) 


branch (AGB) 


Instability strip 


White-dwarf se- 


log(L /Le) 


quence 


45 4 35 
log(T e¢/K) 


Physical significance 


Transition from core to 
shell H burning 


Shell burning of H 


Core He burning 


Stubby red HB formed by 
more metal-rich stars 


Shell He burning 


He? ionization zone gives 
rise to regular variables 


Cooling electron-degen- 
erate stars 


Remarks 


Prominent in globular- 
cluster CM diagrams 


For lower-mass stars ter- 
minated by He flash 


Has characteristic lumi- 
nosity; color sensitive to 
metallicity 

Prominent in disk CM 
diagrams 

Associated with signifi- 
cant and increasing mass 
loss; stars often irregular 
variables 

RR Lyrae and W Virginis 
stars lie at intersection 
with HB; Cepheids are 
massive stars that lie in 
strip 


Blue and faint 


Figure 5.1 The principal features 
in a CM diagram. The MS and evo- 
lutionary tracks are for chemical 
composition Y = 0.28, Z = 0.2. [Af- 
ter Chiosi, Bertelli & Bressan (1992) 
3 from data published in Bressan et al. 
(1993)] 
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vertically through the CM diagram is a strip within which stars pulsate. This 
instability strip is the subject of §5.1.10 below. The evolutionary tracks of 
higher-mass stars intersect the instability strip, and at this intersection are 
found Cepheid variables. These pulsating stars are discussed in detail in 
§5.1.10 below because they are important for the problem of establishing the 
cosmic distance scale (§7.3.1). 

After leaving the MS, stars of mass M < 8 Mo spend some years moving 
up the approximately vertical red-giant branch (RGB) — see Figure 5.1 and 
§3.5. In the CM diagrams of globular clusters, the base of the RGB is joined 
to the end of the MS by the subgiant branch (SGB) which is short and slopes 
from lower left to upper right in the CM diagram. [In the CM diagram of the 
solar-neighborhood (Figure 3.5) this region is rather sparsely populated and 
called the Hertzsprung gap — see §3.5.| The SGB is occupied by stars that 
are burning H to He in a shell rather than in their cores, but have not yet 
developed fully convective envelopes. The RGB is occupied by shell-burning 
stars that possess fully convective envelopes. 

An important concentration of stars is the horizontal branch (HB), 
on which stars burn He in their cores just as on the MS they burn H in 
their cores. The locus of stars of different masses that have just started 
core He-burning is called the zero-age horizontal branch (ZAHB) by 
analogy with the ZAMS. During a star’s time on the HB, its luminosity 
slowly increases — see the upper panel of Figure 5.5. Consequently the HB 
has a non-zero vertical extent with the ZAHB bounding it below. The stars 
of metal-rich clusters and the solar neighborhood congregate at the red end 
of the horizontal branch, so that CM diagrams for such stars contain HBs 
that resemble blobs rather than lines. The HB is then usually called the red 
clump (RC). 

The HBs of metal-poor clusters intersect the instability strip, and at 
this intersection are found RR Lyrae variables. Like Cepheid variables, 
RR Lyrae stars are important for the problem of establishing the cosmic 
distance scale (§7.3.1) and are described in §5.1.10 below. 

Once a star has run out of He in its core, it moves rather quickly away 
from the HB up the asymptotic giant branch (AGB), which runs parallel 
to the RGB, but slightly to the blue (Figure 5.3). Here a star burns He 
in a shell and possibly C in its core. As it moves up the AGB, a star 
becomes more and more subject to variability and mass loss. Probably all 
stars eject material in the form of a stellar wind, but the rate of mass loss 
is usually small early in a star’s life. As it gets older and more luminous, 
a star loses mass faster and faster, with the result that many stars return 
most of their mass to the interstellar medium (Chapter 8) before they die. 
As a star approaches the top of the AGB, its rate of mass loss increases 
and oscillations develop within the star. The cause of these oscillations is a 
kind of duet between a He-burning shell at small radii and a H-burning shell 
around it (Schwarzschild & Harm 1965). For 90% of the time the H-burning 
shell is the more important source of the star’s energy, but occasionally it is 
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Figure 5.2 Evolutionary tracks for solar-metallicity stars (Y, Z) = (0.28, 0.02) with initial 
masses from 0.6 Mg to 100 Mg. On each track several points are marked and numbered. 
Table 5.2 gives the time it takes a star to reach each of these points starting from point 
1. To avoid confusion tracks for M < 2.Mgo terminate at the He flash — see Figure 5.3 
for the further tracks of these stars. All models assume convective overshoot. [From data 
published in Bressan et al. (1993)| 


extinguished by bursts of activity on the part of the He-burning shell. This 
duet generates a complex pattern of convection which may bring C nuclei 
from below the He-burning shell to the surface. In this way a carbon star is 
made out of an M giant. The movement of C nuclei through the He-burning 
shell into the star’s envelope is one example of a general phenomenon known 
as dredge-up. It seems that such mixing into the envelope of material that 
has been processed by nuclear reactions deep in the interior of a star is not 
at all uncommon. In 85.2.2 we shall see that dredge-up is of considerable 
importance for galactic astronomy. 

By the time a star has reached the HB it is thought to have lost about 
20% of its mass (§6.1.3). Observations indicate that the rate of mass loss 
increases markedly as a star approaches the top of the AGB, although there 
is as yet inadequate theoretical understanding of the causes of this increased 
mass loss. A likely consequence of this enhanced mass loss is the formation 
of OH/IR stars — see Table 3.5. Both carbon stars and OH/IR stars have 
come to play important roles in Galactic structure research, because they 
are readily identified objects of high luminosity and can therefore be studied 
throughout the Milky Way. Stars that lie near the top of the AGB usually 
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have appreciable infrared luminosities, which are thought to derive from an 
envelope of dusty gas that has been thrown off by the star and is now drifting 
away from it in a roughly spherical configuration. 

Soon after reaching the top of the AGB a star will shed essentially all 
of its remaining H. Stripped of its blanketing layer of H, the star becomes 
extremely blue, making a violent lurch leftwards in the CM diagram. The 
intense ionizing radiation from the star may cause the ejected H envelope 
to fluoresce brightly as a planetary nebula. Gradually He burning ceases 
in the star and it starts to cool, moving down the white-dwarf cooling 
sequence at the extreme left of the CM diagram. Its mass now probably 
lies in the range 0.55 — 0.6 Mo regardless of its initial mass, although some 
white dwarfs have masses as low as 0.4Mo. Its density is now so great 
that it is sustained against gravitational collapse by the minimum motion 
required of its electrons if they are to satisfy the Pauli exclusion principle, 
just as an ordinary atom is sustained against electrostatic collapse by the 
minimum motion of its electrons. Now that the star is degenerate, it can 
finally evade the law that has hitherto required it to lose heat and contract. 


5.1.3 Characteristic initial masses 


Let us now return to the MS and note some key masses. Figures 5.2 and 5.3 
show how stars of solar metallicity and various initial masses move in the 
CM diagram when they move off the MS. Table 5.2 gives the times that 
elapse between the numbered points on the evolutionary tracks plotted in 
Figure 5.2, while Figures 5.4 and 5.5 plot the luminosities of some of these 
models versus time. 

The least massive stars have My ~ 0.08 Mg. Objects less massive than 
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Box 5.1: Degenerate Objects 


Consider a box filled with N non-interacting Fermions. Since no two 
Fermions can occupy the same state, N different single-particle states 
must be occupied. At zero temperature, these are the N lowest-lying 
energy eigenstates. If the linear scale L of the box is changed, the mo- 
mentum p of each state scales as p x h/L x p'/?, where p is the density 
of the material in the box. The pressure P exerted by the material in 
the box is proportional to the energy density, so P x E/L* « Ep. If the 
particles are moving non-relativistically, E « p? « p?/> and we conclude 
that P « p*/%. This is the relation that is satisfied during adiabatic 
compression of a gas that has ratio of principal specific heats ~ = 3. 
If the fermions are moving relativistically, we have E « p and similarly 
deduce P « p*/*. Thus for a gas of relativistic fermions y takes the value 
4. In Problem 5.4 we will see that this value is critical for gravitational 
collapse. 

If one calculates the mass M of a fully relativistic degenerate object 
as a function of the object’s central density, po, one finds that as pp — 00, 
M tends to a finite value, Mchan, called the Chandrasekhar limiting 
mass. Numerically, Mchan = 5-877? Mo, where pw is the number of 
atomic mass units per relativistic fermion (Weinberg 1972). For a He 
white dwarf, « = 2, so Mohan = 1.47 Mo. 

In many stars, atoms are crushed together so tightly that the vol- 
ume per atom is substantially less than the volume of an isolated atom, 
and most electrons are no longer bound to individual nuclei. If for some 
reason the star contracts, the wavefunctions of the electrons shrink, and 
their energies are driven upwards as calculated above. Hence our results 
are valid even though the electrons have non-negligible electrostatic in- 
teractions with the nuclei and thus do not conform to our simple picture 
of a non-interacting electron gas. Similarly, in a neutron star, the neu- 
trons interact strongly with one another, but our results will be valid so 
long as their wavefunctions scale with the overall size of the star. 


My never get hot enough in their cores to burn H; they are supported by 
electron-degeneracy pressure and become black dwarfs. Jupiter is a low 
mass (~ 0.001. Mo) object of this class. Although they never burn H, they 
may burn light elements such as D and Li, and during this period they are 
known as brown dwarfs. 

The next significant mass on the MS is Meony & 1.1Mo. Stars more 
massive than M ony have convective cores, whereas the cores of less massive 
stars are entirely radiative. (Actually, the size of convective regions grows 
gradually, starting at Mcony and engulfs the core only at ~ 1.5.Mg.) The 
cores of massive stars are convective because at their higher temperatures, 
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Figure 5.4 Luminosity as a func- 
tion of time for stars evolving from 
the MS to the He flash. Curves are 
labelled by initial masses (Mo). 
{From data published in Bressan et 
t/Gyr al. (1993)] 
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the CNO cycle [equation (5.3)] is a more important source of heat than the 
p-p chain [equation (5.1)], and the strong temperature-sensitivity of the CNO 
cycle concentrates energy production at the center of the star. 

A further important mass on the MS is Myer ~ 1.8—2.2 Mo depending 
on the chemical composition. A star that starts out less massive than Myer 
experiences a helium flash. This is the explosive ignition of its helium 
core as the star reaches the top of the RGB — see Box 5.2. Stars that are 
initially more massive than Myer ignite helium quiescently because, unlike 
the cores of lower-mass stars, at the point of ignition their cores are not 
electron degenerate. From our point of view, the importance of Myer is 
that initially more massive stars do not burn their He on the HB or in 
the red clump. The luminosity of HB stars is almost independent of mass 
(Figure 5.3). The reason for this is that He ignites explosively when a core has 
accumulated a characteristic mass (~ 0.45 M@) of He and the HB luminosity 
is largely determined by the mass of the He core. By contrast, at the point 
of He ignition, the mass of the He core of a star of initial mass M; > Myer 
increases with M,, with the consequence that the luminosity of the star 
during core He burning increases with M;. Hence stars with M; > Muer 
lie above the HB during core He burning by an amount that increases with 
M;,. In Figure 5.2 the point of He ignition on tracks for M/Mg < 20 is 
marked by the digit 6 and for M > 3.Mo most He is burnt near the bottom 
of the track that runs first down from the 6 and then up to the tip of the 
AGB above the 6. Notice that a star of mass 2 6 Mg makes a significant 
horizontal excursion as it burns He in its core. This excursion is caused by 
the cessation of convection in the star’s outer layers. 

In stars with initial mass M; > Myp ~ 8 Mo, C ignites quiescently in 
the core. In all but the most massive stars (M 2 50 Mo) nuclear burning 
then proceeds until the core consists of iron-peak nuclei (see §5.2.1 below). 
Since these are the most tightly bound nuclei, further energy cannot be re- 
leased by burning them to still more massive nuclei. Consequently, the core 
tends to contract, and as this contraction drives up the Fermi energy of the 
core’s electrons and heats the core, two processes lead to the core’s catas- 
trophic gravitational collapse: the first process is the capture of electrons 
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Figure 5.5 Luminosity as a function of time from the onset of core 
He burning. The top panel is for stars that experience the He flash 
and the lower panel is for more massive stars. For these stars the 
time axis has been scaled by the time tye given in the last column 
of Table 5.2. [From data published in Bressan et al. (1993)] 


by nuclei and the second is the shattering of nuclei into a-particles by en- 
ergetic photons. Both processes diminish the ambient pressure by removing 
energy from the photon and electron fields, which sustain the core against 
the inward pull of gravity. This leads to a runaway contraction because the 
smaller the core gets, the higher the Fermi energy becomes, and the faster 
electron capture and the photo-dissociation of nuclei proceed. 

As the core implodes, some of the energy released may lead to a spec- 
tacular display as a supernova — see §5.2.3 below. Depending on the mass 


5.1 Stellar evolution and the CM diagram 271 


Table 5.2 Stellar-evolution times for Z = 0.02 


Times (Myr) between points in Figure 5.2 
WAP IAS ASS OS 4 ee BG) ote 


0.6 78245 4724 1591 175.3 17.70 143.7 
1.0 8899 376.3 1995 178.3. 907.5 133.0 
1.5 2471 53.76 5.05 63.42 188.9 137.5 
2.0 1030 20.26 1.700 13.82 44.75 166.7 


3.0 347.4 6.785 0.492 2.199 2.558 107.1 
4.0 171.1 3.119 0.254 0.551 0.683 30.14 
6.0 67.92 1.057 0.070 0.088 0.147 5.94 
9.0 29.45 0.414 0.023 0.020 0.041 1.94 
15.0 12.37 0.136 0.008 0.008 0.011 0.82 
20.0 8.46 0.085 0.005 0.006 0.006 0.61 
40.0 4.43 0.138 0.007 0.541 0.032 ~ 
100.0 1.27 1.079 0.619 0.415 0.023 — 


NOTES: The final column gives the time from point 6 until the 
star’s death. 


SOURCE: From data published in Bressan et al. (1993) 


of the core, the implosion leads to-the formation of either a neutron star or 
a black hole. Neutron stars, like white dwarfs, cannot have masses in excess 
of some limiting value (Chandrasekhar 1931, Oppenheimer & Volkoff 1939). 
This value depends upon the equation of state of the star’s material, which is 
uncertain in the case of neutron stars. Observationally, neutron star masses 
cluster around 1.4Mgo — see Figure 5.6 — which suggests that the limiting 
mass is ~ 1.4Mo, and that a core that is significantly more massive will 
form a black hole when it implodes. 

Stars initially more massive than M,.. ~ 60 Mg become unstable be- 
fore forming iron-peak elements in their cores — at the point of instabil- 
ity, their cores consist largely of O. When the core temperature reaches 
Te = 2m,c?/k ~ 2 x 10’ K, the thermal density of positrons becomes signifi- 
cant and the pair-production instability sets in. The rapid increase with 
temperature T’ in the number density of e~e* pairs implies that the spe- 
cific heat of the vacuum becomes unusually large, and the ratio of principal 
specific heats y in the star drops below the value 7 = 45 This destabilizes 
the star — see Problem 5.4. What happens next is uncertain. Some calcula- 
tions suggest that when the star’s mass is less than ~ 120 Mo, a few violent 
pulsations occur in which episodes of O-burning temporarily reverse gravi- 
tational collapse. Finally the core collapses and the overlying envelope may 
be ejected (Woosley & Weaver 1986). Similar calculations suggest that when 


5 Recall that the principal specific heats always satisfy cp — cy = R, the gas constant. 
Hence increasing cy lowers y = cp/cy. 
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Box 5.2: Ignition in a Degenerate Object 


As Box 5.1 explains, at zero temperature, the N electrons of a body 
occupy the N lowest-lying quantum states. The energy of the highest- 
energy occupied state is called the body’s Fermi energy, HE}. An object 
whose temperature is significantly less than E$/k is said to be electron 
degenerate because its electrons are still largely confined to the lowest- 
lying N quantum states. In such a body the thermal energy of the 
electrons is almost independent of temperature T because there are few 
quantum states into which an electron could be excited by the provision 
of energy of order kT’. 

Mestel (1952) pointed out that nuclear fuels are liable to ignite ex- 
plosively in an electron-degenerate object. The pressure in such an ob- 
ject is provided by electrons, most of which have energies comparable 
to the electron Fermi energy EZ. When the object is heated, the en- 
ergies of these electrons change appreciably only when the temperature 
approaches Tp = E;/k and the object ceases to be completely degener- 
ate. Since in a non-relativistic system, Ep is inversely proportional to 
particle mass, the nuclei become non-degenerate at temperatures thou- 
sands of times lower than Tp. Hence over a large range of temperatures, 
the velocity distribution of the nuclei is Maxwellian while that of the 
electrons, which govern the pressure, is essentially independent of tem- 
perature. As nuclear reactions heat the object, the rate at which they 
occur increases because the number of fast nuclei that are capable of 
penetrating Coulomb barriers increases in accordance with the Maxwell-— 
Boltzmann law. However, little extra pressure is generated, with the 
result that there is negligible expansion of the system and thus the re- 
action rate, which is proportional to the square of the particle density, 
is scarcely moderated by expansion. Hence the reaction rate increases 
almost exponentially, and an explosion ensues. 


120 < (M;i/Mo) < 300, O-burning completely disrupts the star, while an 
even more massive star collapses to a black hole without a single pulsation. 
Before they become unstable it is likely that all stars of mass M; > 60 Mo 
will have largely ejected their H envelopes, with the result that in any subse- 
quent supernova event H-lines will not be prominent. This will cause them 
to be classified as type I supernovae — see §5.2.3 below. 

A relationship that is of particular importance for galactic astronomy is 
the remnant-mass—initial-mass relation, M,(Mj,), which gives the mass 
M, of the remnant to which a star of initial mass Mj, evolves. This relation 
is sketched in Figure 5.7. It is of interest only for M; 2 0.8 Mo because no 
lower-mass stars have had time to complete their evolution. At low values 
of M;, M, climbs slowly through the range of possible white-dwarf masses. 
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Figure 5.7 The remnant-mass— 
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1 < M/Mo < 8 the curve follows 
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At M; = Myup & 8 Mo the remnant switches from being a white dwarf to a 
neutron star, and since, as Figure 5.6 shows, the latter are observed to have 
masses that cluster tightly around 1.4. Mo, we adopt this remnant mass for 
values of M; up to M; = 60 Mo. Some less massive stars may form black 
holes, but stellar evolution calculations suggest that when they implode, the 
cores of these stars are not much more massive than 1.4 Mo. Significantly 
more massive black holes may be produced by stars of mass M; > 60 Mo: 
X-ray astronomy has revealed remnants with masses 2 3.Mo. In Figure 5.7 
we arbitrarily assume that the mass of the black hole remnant rises linearly 
with log M; from 1.4Mo at M; = 60 Mo to 3Mo at M; = 90 Mo. 

The M,(M;) relation plotted in Figure 5.7 is very approximate. Not 
only is it subject to the uncertainties at the high-mass end just discussed, 
but even at the low-mass end it is observationally clear that the very concept 
of a remnant-mass-initial-mass relation is incorrect in detail because stars 
of identical initial mass may evolve to white dwarfs of differing mass for at 
least two reasons. First differences in chemical composition and/or rotation 
may lead to differing amounts of mass loss on the RGB and AGB. Second, 
most stars are in binary systems and a binary companion can either accel- 
erate mass loss through a stellar wind or add material to the surface of a 
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aa 
Box 5.3: The Hayashi Line 


Imagine a furnace that floats freely in space and is enveloped in a massive 
cloud of gas. You are in control of the fuel supply to the furnace. At the 
lowest setting, the heat it produces is radiatively conducted away through 
the gas cloud and radiated from its surface. Then at some critical setting, 
some part of the cloud becomes convective — in places cells of hot gas 
rise, just as hot air rises from a radiator, and are replaced by downward 
moving cells of cold gas. As you turn up the fuel supply more, the 
convectively stirred region grows steadily in radial extent. Eventually, 
the cloud becomes fully convective. 

Consider the location, at each stage in the experiment, of the cloud 
in a plot of luminosity DL versus surface temperature T. Initially there is 
not much heat to radiate, so both LZ and T are small. Then as L rises, 
T tends to rise also. But it turns out that once the cloud becomes fully 
convective, T becomes very nearly independent of L; at this point the 
cloud swells sufficiently rapidly with increasing D that the additional lu- 
minosity can be radiated from the surface without an appreciable change 
in temperature. Hayashi (1961) discovered this surprising behavior of 
fully convective stars, and their locus in the (T, LZ) plane is called the 
Hayashi line in his honor. As Figure 5.10 below shows, the Hayashi lines 
of stars of different masses lie close to one another, with more massive 


stars on the left (at higher T). 


star. Notwithstanding these caveats, the general form of the curve shown in 
Figure 5.7 is probably sound and of considerable astrophysical importance. 
In particular, it makes clear that stars eventually return most of their mass 
to the interstellar medium. 


5.1.4 Bounding curves in the CM diagram 


The dashed lines in Figure 5.1 bound a region within which important pul- 
sating stars lie. The mechanism driving the pulsation of these stars will be 
discussed in §5.1.10 below. The most important stars within this instability 
strip are the RR Lyrae variables, which are fairly metal-poor HB stars, and 
the Cepheid variables, which are supergiants — see Table 3.5. The strip also 
contains W Virginis stars (Table 3.5), 6 Scuti stars (pulsating MS stars) and 
pulsating white dwarfs. 

In the uppermost region of the CM diagram only variable stars are 
found (de Jager 1984), and the curve that forms the upper edge of the region 
occupied by non-variable stars is called the de Jager limit. This limit 
slopes downwards from left to right, so that the most luminous blue stars 
are about a factor of six brighter than the most luminous red stars. This is 
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Figure 5.8 Evolutionary tracks for stars with Z = 0.0004, which implies [Fe/H] = —1.7. 
These tracks should be compared to those of solar-metallicity stars shown in Figure 5.2; to 
facilitate this comparison the dotted curves show those tracks for M = 1 and 6 Mo. For 
clarity the tracks for M = 1.5, 1 and 0.6 Mo terminate at the He-flash — see Figure 5.9 for 
the subsequent evolution of these stars. Table 5.3 gives the times required to get between 
the marked points. [From data published in Fagotto et al. (1994)] 


interesting because massive stars evolve from the MS at essentially constant 
luminosity. Hence the evolution of the most massive stars carries them across 
the de Jager limit. It is observed that the largest mass-loss rates occur in 
the vicinity of the de Jager limit (de Jager, Nieuwenhuijzen & van der Hucht 
1988). One possible explanation for the absence of stars to the right of the 
de Jager limit is that mass loss arrests the redward movement of these stars 
over this boundary. An alternative explanation is that the stars evolve so 
quickly once they reach the de Jager limit that we stand little chance of 
observing them in the region beyond the limit. 


To the right of the populated part of the CM diagram lies the forbidden 
region; this region is forbidden in the sense that an equilibrium star cannot 
lie there. The boundary between the forbidden and populated regions of 
the CM diagram is formed by the Hayashi line of the lowest-mass stars — 
see Box 5.3 for the definition of a star’s Hayashi line, which is essentially 
coincident with the RGB on to which the star evolves from the MS. 
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5.1.5 Dependence of CM diagrams upon metallicity 


While the gross structure of the CM diagram is the same for stars of any 
chemical composition, the precise locations of the various branches do depend 
on the composition parameters (X,Y, Z) that were defined in §3.4. Figure 5.8 
plots evolutionary tracks for moderately metal-poor stars, for which appro- 
priate values are Y = 0.23 and Z = 0.0004 — as we shall see below (§5.1.6), 
while most He was produced in the Big Bang, significant quantities of He are 
produced alongside heavy elements, and it is appropriate to decrease both 
Y and Z together. 

Careful comparison of these tracks of Figure 5.8 with the ones for Y = 
0.28 and Z = 0.02 plotted in Figure 5.2 reveals that lowering Z by 1.7 dex 
modifies individual stellar tracks in several important ways that we now 
discuss. To facilitate discussion, the tracks of solar-metallicity stars of mass 
1 Mg and 6 Mo are shown as dotted curves in Figure 5.8. 

Lowering Z causes individual stars to become both brighter and hotter. 
Above 4 Mo the main effect of lowering Z is to make stars hotter: on the 
ZAMS, a 0.6 Mo star is brighter by a factor 2.3 and hotter by a factor 
1.26 at the lower metallicity, while at the lower metallicity a 4 Mg star is 
brighter by a factor 1.3 and hotter by a factor 1.33. Although lowering Z 
makes individual stars brighter, the overall effect of lowering Z by 1.7 dex is 
to move the ZAMS down by just under 1 mag at a given color.® In §3.3 we 
saw that extremely metal-poor MS stars are called subdwarfs. These stars 
owe their name to the fact that they lie below the ZAMS of the Hyades 
cluster, which is the standard for stars of approximately solar metallicity. 
Box 5.4 describes an important technique for identifying subdwarfs. 

Lower-metallicity stars make longer excursions towards higher temper- 
atures and bluer colors during He burning. In Figure 5.2 only stars of 6 Mo 
and above make such excursions, and these are of smaller extent than the 
corresponding excursions in Figure 5.8. Moreover, at Z = 0.0004 blueward 
excursions are significant for all stars with masses of 2 Mo and above. These 
excursions are of particular importance because they are liable to carry the 
star into the instability strip, when it will become an easily identified variable 
star (§5.1.10). 

Table 5.3 lists the times required to evolve between the points marked in 
Figure 5.8. Since the luminosity of a star generally increases with decreasing 
metallicity, it evolves more quickly, and the times in Table 5.3 are generally 
shorter than those in Table 5.2. 

Comparison of Figures 5.2 and 5.8 shows that at low metallicity the 
RGB is much more nearly vertical in the CM diagram, and has a color that 
depends less strongly on the masses of the stars that lie on it, than is the 
case for solar metallicity. Observationally, these differences are apparent in 
Figure 6.3. 


6 At a given luminosity lowering Z by 1.7 dex shifts the ZAMS to the left by ~ 0.06 
in log Teg, which corresponds to about —0.32 in V — K color (Table 3.6). 
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Box 5.4: Ultraviolet Excess 


An important way of identifying subdwarfs of unknown distance and 
therefore unknown location in the CM diagram is to measure the ultra- 
violet excess, 6(U — B), which is the difference between U — B for a 
star and that of the Hyades MS at the star’s value of B—- V. (Both 
U — B and B — V should have been corrected for any reddening.) At 
fixed metallicity, 6(U — B) is a decreasing function of B — V — it tends 


to be largest for blue stars because line blanketing occurs mostly in the 
blue region of the spectrum (§2.3.2). 
e5E In an attempt to define a color in- 
E + dex that is an unambiguous metal- 
ele | licity indicator for any dwarf, 
= [ | Sandage (1969) defined 6p.¢ to be 
Bae 7 4 6(U — B) multiplied by the correc- 
S c j tion factor that is plotted at left: 
Ss C this correction factor is designed to 
aes make 69,g independent of B — V at 
fixed metallicity. Laird, Carney & 
9.5 nee ne ne a ' ‘Latham (1988) found that the re- 
B-V lationship between 69,6 and [Fe/H] 
is 
60.6  —0.0776 + (0.01191 — 0.05353[Fe/H])/” (1) 


and that the vertical offset 6My between the Hyades ZAMS and that 
formed by subdwarfs of given 69.6 is 


dMy = 0.862( — 0.68886.¢ + 53.1465 6 — 97.00465 6). (2) 


Figures 5.3 and 5.9 show the evolution of low-mass stars from the HB 
up the AGB. At low Z the AGB, like the RGB, runs more nearly vertically 
in the CM diagram and lies blueward of the AGB for solar-metallicity stars; 
a star half way up the AGB for Z = 0.0004 is hotter than a solar-metallicity 
star that is half way up its AGB by ~ 0.1 dex. 

Careful comparison of the locations of the bottoms of the tracks in 
Figures 5.3 and 5.9 reveals that for low Z the HB lies slightly lower than 
it does for solar metallicity. Calculations of Lee, Demarque & Zinn (1990) 
yield for Y = 0.23 


My (HB) = 0.17[Fe/H] + 0.82 . (5.4) 


but this dependence is unfortunately still controversial (Bounanno, Corsi & 
Fusi Pecci 1989, Chaboyer, Demarque & Sarajedini 1996). 
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Table 5.3 Stellar-evolution times for Z = 0.0004 


Times (Myr) between points in Figure 5.8 
M/ Mo 1-2 2-3 3-4 4-5 5-6 tHe 


0.6 41939 2616 782 111 17.40 137.5 


1.0 5095 1423 140 36.5 10.57 114.0 

1.5 1516 38.7 30.1 17.1 93.31 112.2 

2.0 662 17.2 7.17 4.69 8.291 158.2 
3.0 253 6.38 1.72 0.931 0.657 43.67 
4.0 139 3.38 0.759 0.317 0.176 18.61 
6.0 64.6 1.120 0.229 0.070 0.047 5.91 
9.0 31.6 0.431 0.069 0.017 0.016 2.21 
15.0 14.31 0.160 0.028 0.004 0.006 0.89 
20.0 9.87 0.095 0.040 0.030 0.597 - 
40.0 5.02 0.035 0.021 0.016 0.367 a 
100.0 2.90 0.148 0.012 0.001 0.258 - 


NOTES: The points are numbered according to the scheme used 
in Figure 5.2. The final column gives the time from point 6 until 
the star’s death. 


SOURCE: From data published in Fagotto et al. (1994) 
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Figure 5.9 The full curves show the evo- 
lutionary tracks of low-metallicity stars 

(Z = 0.0004) of mass 0.7, 1.1 and 1.5 Mo 
from the ZAHB to the tip of the AGB. The 
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In Figure 5.9 it can be seen that on the HB 0.7 Mo stars lie significantly 
to the left of 1.1 Mg stars, which themselves lie slightly below 1.5 Mg stars. 
In Figure 5.3, by contrast, the less massive a solar-metallicity star is, the 
further to the right it sits on the HB. For solar metallicity the total extent 
of the HB is small, however, so that a red clump at the bottom end of the 
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AGB is all that can be seen of the HB in metal-rich clusters. 


5.1.6 The cosmic helium abundance 


Since the discovery of the cosmic background radiation by Penzias & Wilson 
(1965), there has been little doubt that most of the He in the Universe was 
synthesized in the first few minutes of the Big Bang.’ Hence even the oldest 
and most metal-poor stars should have a substantial He abundance, and we 
would very much like to measure this abundance since this would tell us 
how much He was made in the Big Bang. Unfortunately, the only old stars 
that are hot enough to excite spectral lines of He are evolved stars that have 
themselves synthesized additional He. Clearly, it is necessary to measure Y 
in an unevolved star. One approach is to observe hot, young stars in dwarf, 
metal-poor galaxies, and then to plot the values of Y that one obtains as 
a function of the metallicity Z in these stars: the primordial value of Y is 
obtained by extrapolating Y(Z) back to Z = 0. Another technique involves 
a similar extrapolation of the values of Y and Z that are measured for the 
interstellar media of metal-poor galaxies. These techniques lead to the value 
Y = 0.228+0.005 (Pagel et al. 1992), which does not differ significantly from 
the standard value Y = 0.235, which is assumed in many stellar-evolution 
calculations. 

Lowering Y at fixed Z, raises the ZAMS. For astrophysically plausible 
values of Y,Z, Lebreton (1998) finds that (0Mt.i/OY)z.7.4 ~ 3. Hence, 
fixing Z = 0.02 and lowering Y from 0.28 to 0.23 raises the ZAMS at con- 
stant Teg by about 0.15 mag. The connection between Y, Z and the location 
of the ZAMS provides a method for directly determining the He content of 
the oldest, most metal-poor stars. Cayrel (1968) analyzed several cool subd- 
warfs with accurately known distances, particularly Groombridge 1830, and 
concluded that the ZAMS of these stars lies about 0.75 + 0.3 mag below the 
Hyades MS in the theoretical CM diagram. Eggen (1973) confirmed this con- 
clusion by extending the photometry to much cooler subdwarfs and showed 
that, in a plot of My versus (R — J), the subdwarfs lie 0.75 mag below the 
Hyades MS. Eggen concluded that Y ~ 0.23 in the subdwarfs. This figure is 
to be compared to Y = 0.28 for stars of solar composition. In number ratios, 
N(He)/N(H) + 0.075 for subdwarfs and 0.10 in solar-composition stars. 


5.1.7 Simple numerical relations 


The theory of stellar structure relies heavily on the results of large-scale 
computations. We now describe a few numerical results that are simple 
enough to be remembered and powerful enough to be valuable when making 


7In fact Gamow (1948) predicted the existence and temperature of the background 
radiation precisely from the consideration that most He should have been synthesized in 
the Big Bang. 
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order-of-magnitude estimates of the way in which a stellar system should 
evolve. 

The models of Bressan et al. (1993) imply that on the ZAMS a star of 
mass M has luminosity® 


81(M/Mo)** for M 2 20 Mo, 
———— OS 1.78(M/Mo)** for 2Mo <M s 20 Mo, (5.5) 
0.75(M/Mo)*® for MS 2Mo. 


Figure 5.4 shows that as it sits on the MS a star’s luminosity steadily in- 
creases from this ZAMS value. 

The MS lifetime, ts, of a star is fixed by the length of time that its 
luminosity can be supported by thermonuclear conversion of H to He. The 
fusion of four protons into one He nucleus releases an energy of 26.7 MeV, of 
which about 25 MeV is injected into the burning region rather than radiated 
in neutrinos. Thus the conversion of a mass AM of hydrogen releases 
E = 0.0067AMc? of useful energy. If a fraction a of the total mass of a 
star can be converted, then ts = (0.0067aMc?/L). Detailed computations 
show that, when about one-tenth of the stellar mass has been converted, the 
star evolves rapidly away from the MS. Hence, to order of magnitude, we 
find M Lv-t 

mag 10 ag) ) Gyr. (5.6) 

Stars of initial mass M; < Myer have bolometric luminosity [yg ~ 
50 Le while on the HB (Figure 5.5), and they remain there until they have 
burnt their He cores, which have mass Mye = 0.45 Mo. Typically half of 
the He will be converted to C and most of the other half to O. Converting 
a mass AM of He to half C and half O releases 7.2 x 10-*AMc? of energy. 
Hence the time these stars spend on the HB is of order (cf. Figure 5.5) 


7.2 x 10-4 x 0.45 Moc? 
Ba a 00g. ~ 0.1 Gyr. (5.7) 


A star of initial mass Mj; > Muyp converts ~ 1.4Mo to Fe before 
exploding as a supernova. Conversion of a mass AM of H to Fe releases 
about 0.0085A.Mc? of useful energy. This can sustain the star at luminosity 
L=Lz x 10° Lg for a time 


_ 0,0085 x 1.4Mge? 


TsG = ~ 0.18L; 1 Gyr. 5.8 
ae 1000L3 Lo a Ao-8) 
5 Tout et al. (1996) give more elaborate and accurate analytic fits to the function 
L(M, Z) 
° The fraction of energy carried off by neutrinos depends in detail on the temperature 
and density in the burning region. 
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More precise results are given in Table 5.2. 

Since most of the support of a white dwarf derives from electron- 
degeneracy pressure, which is independent of temperature, the radius of a 
white dwarf is approximately constant as it cools. Hence the white-dwarf 
cooling line in the CM diagram is given by 


Lx Ti or Myo = —10log Teg + const. (5.9) 


These constant-radius lines run roughly parallel to the MS. The observed 
positions of white dwarfs in the CM diagram — see Figure 6.17 — are in good 
agreement with theory (Cool, Piotto & King 1996). 


5.1.8 Star formation 


The earliest stages in the life of a star lie beyond the scope of the conventional 
theory of stellar structure, which assumes that the star is at all times in 
hydrostatic equilibrium.1° This is one of the reasons why the theory of star 
formation is much less well developed than that of quasi-static stars. Other 
factors that make it hard to develop a convincing theory of star formation 
include the complex nature of the interstellar medium from which stars form 
(Chapter 8), which makes it hard to know what initial conditions to use, 
and the likely importance of magnetic fields for the formation process, which 
make it essential to treat the problem as one in magnetohydrodynamics and 
to calculate accurately the degree of residual ionization at each point in 
a collapsing cloud. Moreover, stars are observed to form as members of 
binaries, which are likely themselves to be members of an association or a 
cluster, and a star’s interactions with its neighbors is likely to be crucial for 
its early development. 

For all these reasons we cannot present an authoritative account of the 
formation of stars. Nonetheless, given the importance of star formation for 
galactic astronomy it would be inappropriate to pass over the topic in com- 
plete silence. We therefore now sketch in broad outline how stars probably 
form, and draw attention to the key physical processes involved. 

Clusters of young stars are invariably associated with dense interstellar 
clouds (see §6.2), and, as observations of external galaxies show (see §5.1.8), 
they are often also associated with spiral arms. From this coincidence we 
conclude that stars form from dense interstellar gas, probably after the gas 
has been compressed in a large-scale shock front. 

The internal dynamics of dense interstellar clouds are not well under- 
stood although spectacular advances are currently being made in the obser- 
vational study of these complex systems. Gravity pulls them together, but 
some combination of turbulent energy and magnetic energy resists gravity 


10In the theory of pulsating stars, hydrodynamic effects are taken into account, but 
the configurations considered are near-equilibrium ones. 
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Figure 5.10 Left panel: the evolution of stars massive enough to burn H on the MS before 
they reach the MS. Each curve is labeled with the object’s mass in solar units. At first 
these objects are fully convective and descend almost vertically along the Hayashi track. 
Stars with masses M > 0.3.Mo eventually cease to be fully convective and move sharply 
leftwards towards the ZAMS. The positions reached by stars after 1, 10, and 100 Myr from 
birth are marked with dots. Right panel: the equivalent figure for low-mass objects. At 
first the descent of these objects is slowed by the combustion of Li and D. Once an object 
has exhausted its D, it moves more rapidly towards lower temperatures and luminosities. 
{From data published in D’Antona & Mazzitelli (1994)] 


with surprising effectiveness. At the densest points within clouds the ioniza- 
tion level of the material, which is governed by the fluxes of ionizing radiation 
and cosmic-ray particles (see §8.1.4), can become small, and the coupling be- 
tween the magnetic field and the gas weakens. This decoupling allows the 
magnetic energy, which plays a large role in supporting the material against 
gravitational collapse, to dissipate. As magnetic energy is dissipated, the 
material becomes more dense, the ionization level falls still further and the 
magnetic energy ebbs away even faster. A protostar has formed. 

Imagine a protostar in essentially free fall. As the protostar collapses, 
gravitational energy is released and must be radiated away. Because the 
material is initially nearly transparent to infrared radiation, radiative energy 
loss is efficient, the luminosity rises rapidly, and the temperature remains 
quite low (10K to 20K). Eventually the material becomes opaque in a dense 
core; this opacity traps the radiation and allows the core to heat. As the 
core heats, pressure builds and arrests the collapse. The core continues to 
heat until its temperature rises to about 2000K, at which point molecular 
hydrogen dissociates into atomic hydrogen. This dissociation provides an 
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energy sink and triggers another free-fall collapse of the core, which continues 
to heat until the hydrogen not only dissociates but also ionizes. The newly 
freed electrons trap radiation rather efficiently. Pressure balance can again 
be achieved, and core collapse halts. At this point the luminosity of the star 
is sufficiently large for it to be fully convective — see Box 5.3. Hence in the 
CM diagram it lies upon its Hayashi line — see Figure 5.10. 

Once a star has reached its Hayashi line, it can achieve a state of hydro- 
static equilibrium. It then moves down the Hayashi line on a much longer 
timescale — this downward motion is further slowed by nuclear burning of 
such light isotopes as deuterium and lithium. In stars more massive than 
M ~ 0.3Mo, convection eventually peters out in the core, and, as the size 
of the radiative core grows, the track turns sharply to the left — see Fig- 
ure 5.10. Subsequent evolution of these more massive objects occurs with 
the star contracting at nearly constant luminosity and ever-increasing effec- 
tive temperature. Ultimately, the temperature in the core rises to the point 
at which thermonuclear fusion of hydrogen into He becomes possible, and 
the star settles on to the MS. The pre-MS evolution of a star proceeds rather 
rapidly — for a 1 Mo object it takes ~ 75 Myr. 

Stars with masses in the interval (0.08 Mg,0.3Mg) settle on to the 
MS while still fully convective, so they continue to move vertically in the 
CM diagram until they reach the ZAMS. Stars with masses M < 0.08 Mo 
run out of fuel once their D has been burnt, and then slide down and to the 
right in the MS as Figure 5.10 shows. 

Many young stars are observed to be surrounded by differentially ro- 
tating disks of gas and dust. In fact, the solar system is presumed to have 
formed from such a disk, and T-Tauri stars (Table 3.5) possess disks. Gravi- 
tational energy released as the material of the disk accretes on to the central 
star can make a significant contribution to the luminosity of such systems. 
Some of this energy is channeled into the formation of jets of material that 
is ejected along the rotation axis of the disk. The origin and astrophysical 
significance of these jets are not at all well understood, but jets seem to be 
a generic by-product of accretion (see, for example, §4.6). 


5.1.9 The initial mass function 


A fundamental quantity for studies of galaxy formation and evolution is the 
initial mass function (IMF), ¢(M), that specifies the distribution in mass 
of a freshly formed stellar population. Specifically, just after a burst of star 
formation, let there be 


dN = Nof(M)dM (5.10) 


stars with masses in (M,M-+dM). The normalizing constant No depends 
on both the magnitude of the burst and the way that € is normalized. Since 
very large numbers of inconspicuous low-mass stars are likely to be formed, it 
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makes more sense to normalize € in terms of the total mass of the new-born 
stars rather than their total number. Therefore we normalize € such that 


/ dM ME(M) = Mo. (5.11) 


With this normalization, No is the number of solar masses contained in the 
star-burst. 

We have no a priori reason to suppose that € is a universal function that 
applies to all starbursts. However, the available data are consistent with & 
having been the same at M S Mo for starbursts that have occurred under 
remarkably different circumstances over the entire age of the Galaxy. 

The procedure for determining € is as follows. First we determine the 
luminosity function, 6(M), for MS stars in a cluster or the solar neighbor- 
hood - see §3.6. Then we correct this luminosity function for the effects of 
stellar evolution. If the population under study is coeval (ie., all its stars 
were born at the same time), there is no correction to make. If, by contrast, 
the star-formation rate within the population has been roughly constant, as 
is thought to be the case in the solar neighborhood, we estimate the initial 
luminosity function @9(M) to be 


Bo(M) = &(M) x ee for T™s(M) <t (5.12) 


otherwise. 


Here t is the time since the population started to form and Tys(M) is the MS 
lifetime of a star of absolute magnitude M. The factor t/7m4s(M) corrects 
for the fact that the only stars of magnitude M that we see are those that 
formed in the last fraction tys(M)/t of the population’s life.1t Once the 
initial luminosity function 69(M) has been determined, we have 


dM 


g(M) = 


= 8o[M(M)], (5.13) 


where M(M) specifies the relationship between mass and absolute magni- 
tude for stars of the appropriate type. 

The function M(M) can in principle be determined from either theory or 
observation. Theoretically, we obtain M(M) by coupling stellar-atmosphere 
models to models of a MS stars of various masses and appropriate metal- 
licities. This procedure leads to reliable results for most stellar masses, but 
becomes difficult and unreliable for masses < 0.5 Mg — see D’Antona & 
Mazzitelli (1994), D’Antona & Mazzitelli (1996) for details. One problem 
is that low-mass stars are slow to settle on to the MS, so one may need to 
include models that lie significantly above the MS. A more serious problem 


11 See Problem 5.8 for the correction factor appropriate to an exponentially declining 
star-formation rate. 
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Figure 5.11 The data points show the masses of MS binary com- 
ponents with the given absolute magnitudes My. The full curve 
was derived by a complex procedure that takes into account both 
these data and the solar-neighborhood luminosity function shown 
in Figure 3.12 (Kroupa, Tout & Gilmore 1993). The dashed curve 
shows the M(My) relation for metal-weak stars. [After Kroupa, 
Tout & Gilmore (1993) with additional data from Henry & Mc- 
Carthy (1993) and Brewer et al. (1993)] 


is that the atmospheres of cool stars have extremely rich spectra that deviate 
significantly from that of a black body — Bessell (1991) shows sample spectra. 
Simulating the complexities of these spectra, which depend not only on the 
details of chemical abundances but also on the abundances of molecules such 
as TiO and H20, is a highly complex business. Finally, for M < 0.3. Mo the 
star is fully convective, and this adds to the complexity of the modeling. 

In view of these complexities in the theoretical determination of M(M), 
this relation is often obtained empirically from binary stars. Figure 5.11 
shows the empirical data, which are sparse and subject to significant obser- 
vational uncertainty. 

These uncertainties are unfortunate since we see from equation (5.13) 
that € and ®o are related by the derivative of the M(M) relation. One 
cannot hope to constrain this derivative strongly with data of the quality 
shown in Figure 5.11, especially as theory predicts that the M(M) relation 
should reflect complex physics and therefore be itself complex. Kroupa, Tout 
& Gilmore (1990) argue that the natural procedure in these circumstances 
is to assume that £(M) is a simple function and to choose M(M) such that 
(i) it is compatible with the empirical data shown in Figure 5.11, and (ii) it 
reproduces ®9(M), wiggles and all. The full curve in Figure 5.11 shows the 
My(M) relation that emerges from this procedure (Kroupa, Tout & Gilmore 
1993). 
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Figure 5.12 The IMF from Scalo (1986). For masses M > Mo 
three sets of points are shown, each set being for a different as- 
sumption about the ratio of the current rate of star formation to 
its average over the lifetime of the solar neighborhood. The curve 
defined by the points that are based on the assumption of a con- 
stant star-formation rate (squares) can be approximated by the 
three power-law segments defined in equation (5.16). 


Why should €(M) be a simple function and @g(M) a complicated one? 
The argument is that star formation is a chaotic process that proceeds 
through densities and temperatures that vary by orders of magnitude. It 
is unlikely that such a scrambling process bears the imprint of any particu- 
lar length or energy scale, so £(M) should be rather a featureless function. 
®o(M) is, in effect, this same function reflected in the distorting mirror of 
the function My(M). As we have seen, this last function is rich in structure, 
and we expect this structure to be reflected in @9(M). 

The prototypical simple function is a power law, so € is frequently as- 
sumed to be a simple power of M. In a classic study Salpeter (1955) con- 
cluded that the then-available evidence pointed to € having the form 


&(M) « M~?%> (Salpeter IMF). (5.14) 


Since for a power law, 


Ex M~*, (5.15) 


equation (5.11) predicts that the total mass involved in a starburst diverges 
at low masses for a > 2 and at high masses for a < 2, it is not surprising 
that recent studies imply that € tends to steepen with increasing mass, so 
that the effective value of a is less than 2 at small masses and greater than 
2 at large masses. Figure 5.12 shows the IMF derived by Scalo (1986). For 
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masses M 2 0.2 Mo, Scalo’s data can be adequately fitted by three power- 
law segments: 


M-?-45 for M > 10Mo, 
E(M) x ¢ M7327) forl1 Mg <M<10Mo, (ScaloIMF). (5.16) 
M7133 for M < 0.2Mo 


Kroupa, Tout & Gilmore (1993) likewise advocated a three-power form for 
€, but one that falls off much more steeply at the high-mass end: 


M-45 for M>1Mo, 
E(M) x ¢ M-?? for 0.5 Mo <M<1Mo, (5.17) 
M-1}2 for M <0.5 Mo. 


In §8.2.8 we shall encounter evidence that the average IMF within external 
galaxies has a S 2.5 even at the highest masses. This result is not necessarily 
incompatible with equation (5.17), however, because the solar neighborhood 
could be deficient in massive stars relative to the Galaxy as a whole. 

Note that €(M) continues to increase with declining mass down to the 
smallest masses for which reliable data are available; the peak in ®(M) 
at faint magnitudes seen in Figure 3.12 arises because the luminosity of a 
MS star of mass M falls precipitately as the minimum mass for hydrogen 
burning, M ~ 0.08 Mg is approached, with the result that the stars with 
masses in a small range near 0.1.Mo are strung out over a huge interval 
in absolute magnitude M. Unfortunately, neither observation nor theory is 
able to determine with any accuracy the function M(M) that describes this 
distribution, so it is currently impossible to make any definitive statement 
about the form of (M) for M $0.2 Mo. 


5.1.10 Pulsating stars 


All stars are to a greater or lesser degree variable because stars tend to 
oscillate, and these oscillations give rise to variations in luminosity. The am- 
plitude of a star’s oscillations is usually very small, so that the associated 
variations in luminosity are tiny — solar-type stars vary in luminosity by of 
order a micro-magnitude. The study of the small-amplitude stellar oscilla- 
tions, or asteroseismology, has become an important branch of astronomy 
because the oscillations involve the entire star, so from them one can glean 
information about the deep stellar interior. However, we focus here on a 
minority of stars that oscillate with much larger amplitudes, since the study 
of these stars has had a profound influence on our understanding of Galactic 
structure and cosmology. Readers who wish to learn about asteroseismology 
are referred to Brown & Gilliand (1994). 

Like a bell, a star is capable of oscillating at a number of discrete fre- 
quencies — the frequencies of its normal modes. The normal modes of a star 
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fall into two broad classes, p-modes and g-modes. For our purposes it suf- 
fices to focus on spherical p-modes. A p-mode involves the star’s atmosphere 
breathing in and out, and is associated with substantial pressure fluctuations, 
which is the origin of the ‘p’ in the name of this class of modes.!2, When a 
star oscillates in its fundamental p-mode, all points in the atmosphere move 
in and out together and there is no point at which the pressure fluctuations 
vanish throughout an oscillation. By contrast, when the star oscillates in its 
first overtone mode, some points in the atmosphere move inwards while 
others are moving outwards, and there is a radius within the star at which 
the pressure fluctuations vanish throughout an oscillation. Very crudely, the 
fundamental p-mode is a radially propagating sound wave that is confined 
within the star with half its wavelength equal to the stellar diameter, while 
the first overtone mode is a similar sound wave that has three halves of its 
wavelength equal to the diameter of the star.15 The light curves of stars that 
pulsate in the first overtone mode tend to be rather sinusoidal, while those 
of fundamental-mode pulsators are frequently of larger amplitude and highly 
asymmetrical in form. 

Whereas low-amplitude stellar oscillations such as those observed in 
the Sun are excited by the random pressure and velocity fluctuations that 
are associated with convection, most large-amplitude pulsations of stars are 
excited by an instability, the «-mechanism, which was identified by Ed- 
dington. The physics underlying this mechanism is that the opacity, «, of 
material can increase rather than decrease when the material is heated. As 
the following argument shows, this behavior enables the atmosphere of a star 
to function as a heat engine. 

At one point in the cycle a layer of material loses support against the 
star’s gravity and falls downwards. This downward motion tends to compress 
the layer, with the result that it heats up and becomes more opaque to 
radiation. Since radiation now diffuses more slowly through the layer, heat 
builds up below it. In consequence of this build-up of heat, the pressure rises 
below the layer and pushes the layer back outwards. As it moves upwards, 
the layer expands, cools, and becomes more transparent to radiation. This 
increased transparency allows energy to escape freely from below the layer, 
with the result that the layer loses pressure support. The layer now falls back 
downwards and the cycle repeats. In effect, the stellar envelope behaves like 
a steam engine, with radiation playing the rdle of steam, the layer the rdle of 
the piston, and the anomalous behavior of the opacity the réle of the valve. 

Using an order-of-magnitude analysis, which in essence argues that the 
pulsation period should be of the order of the time required for a sound wave 
to move through the star (see Problem 5.6), one can show that a relationship 


12 The ‘g’ of g-mode stands for ‘gravity’. 
13 The pressure fluctuations must vanish at the stellar edge and be maximal at the 
centre, where there must be zero displacement. 
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of the form 
—\1/2 
P (=) =Q (5.18) 
PO 

exists. Here, P is the period, f denotes the mean density of the star, and 
Q, the pulsation constant, is a number of order (Gfo)7!/? ~ 1hr. A 
simple analysis shows that at constant effective temperature, Teg, the mean 
density of a star decreases with increasing luminosity — see Problem 5.7. 
Hence, if we apply equation (5.18) to stars of similar Tg, we predict that 
they should obey a period-luminosity relation (PL relation) in the sense 
that more luminous stars should have longer pulsation periods. Henrietta 
Leavitt (1912) discovered that the Cepheids obey just such a relation. This 
was a major breakthrough because if the PL relation could be calibrated 
properly — see Box 5.5 — one could deduce the distance to any Cepheid simply 
by determining its period, inferring its luminosity from the PL relation and 
comparing this with its apparent brightness. 

As Figure 5.1 shows, there is an entire strip of the CM diagram — the 
instability strip — within which stars pulsate. The « mechanism drives pulsa- 
tions in stars that lie within this strip because in these stars the ionization of 
Het to He*+ causes « to increase with temperature at radii that are strate- 
gically placed for the excitation of the fundamental and/or first overtone 
modes. Stars in the instability strip play a central rdle in astronomy because 
their very distinctive light variations allow them to be identified unambigu- 
ously even in a crowded field of stars, and their absolute magnitudes can be 
determined from their periods. A great deal of information regarding both 
the physical principles and applications of pulsating stars may be found in 
Cacciari & Clementini (1990) and Gautschy & Saio (1995), (1996). Here we 
summarize the properties of the most important stars that lie within this 
instability strip. We work from the top of the strip downwards. 


Classical Cepheid variables Stars with M 2 3.Mgo make excursions to 
the left in the CM diagram after reaching the RGB — see Figure 5.2. More 
massive stars move further to the left, and a star with M 2 5 Mo is liable 
to pass into the classical instability strip on such an excursion and become a 
classical Cepheid variable. The lines in Figure 5.13 indicate the extent of the 
instability strip, as predicted by the models of Chiosi, Wood & Capitanio 
(1993). Notice that the strip is predicted to become wider with brightening 
absolute magnitude My. The dots in Figure 5.13 give the location in the 
CM diagram of the 98 Galactic Cepheids with well determined reddening. 
All fall in or near the theoretically predicted instability strip. 

Strictly, the sobriquet ‘classical’ is necessary to distinguish classical 
Cepheids from W Virginis stars, which are often called ‘Population II Ceph- 
eids’ and are discussed below.*4 We shall often drop the ‘classical’ on the 
understanding that a plain Cepheid variable is a classical Cepheid. Since 


14 Classical Cepheids are also called 6 Cephei stars. 
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Box 5.5: How the Universe 
Doubled in Size During 1952 


The Cepheid PL relation discovered by Leavitt (1912) was a linear cor- 
relation between log P and the apparent magnitudes of Cepheids in the 
Small Magellanic Cloud. Before this relation could be exploited for dis- 
tance estimation, it had to be expressed in terms of absolute magnitudes 
by determining the zero-point. This problem has an interesting history 
— see Struve & Zebergs (1962), Chapter 15. 

No Cepheid was close enough to the Sun to have a measurable 
trigonometric parallax, so for the calibration Shapley used the statistical 
parallaxes (see §2.2.4) of Cepheids. Unfortunately, interstellar absorp- 
tion, which is always significant along lines of sight to Cepheids, was 
ignored, so the estimated Cepheid luminosities were much too small. 
This problem was compounded by the assumption that Cepheids have 
the same properties as W Virginis stars in globular clusters. With this 
assumption it was found that the RR Lyrae stars and Cepheids all fell 
along a single apparently well-defined period-luminosity law. This har- 
mony lulled astronomers into a false sense of security about the accuracy 
of the results. 

The problem was not clearly resolved until 1952 when Baade dis- 
covered that 200-inch telescope photographs of M31 failed to reveal RR 
Lyrae variables in the globular clusters at the magnitude predicted by 
the then-adopted distance scale, but instead showed only the brightest 
cluster giants. Inasmuch as the absolute magnitudes of the RR Lyrae 
stars had been determined independently from statistical parallaxes and 
cluster CM diagrams, he realized that the error lay in the absolute mag- 
nitudes of the Cepheids, which had been set about 1.5mag too faint. 
Making this revision doubled the estimated distance to, and the size 
of, M31 (making it comparable in size to our Galaxy), and the revision 
brought its globular-cluster luminosities into line with those in our Gal- 
axy. Furthermore, it doubled the estimated size of the Universe, because 
the extragalactic distance scale had been calibrated using Cepheids — see 
Chapter 7. 


Cepheids are massive, they are young stars and tend to lie in spiral arms 
within the Galactic plane (§10.4.5). 

Classical Cepheid light curves are strictly periodic. In the B band they 
usually show an abrupt rise by ~ 1 mag (in about 20% of the period) followed 
by a slower decline (over about 50% of the period). At infrared wavelengths 
their light curves show smaller-amplitude and more nearly sinusoidal varia- 
tions. At maximum light (phase = 0.0), the star has its highest temperature, 
earliest spectral type, and greatest outward radial velocity; at minimum light 
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Figure 5.13 The Cepheid instability 
strip in the CM diagram. The points 
represent the positions of individ- 
ual Galactic Cepheids from Fernie 
(1990). The lines indicate the extent 
of the instability strip as calculated 
from stellar pulsation models (Chiosi, 
t Wood & Capitanio 1993). Each line 


pe gi ys oe ps marks the locus of Cepheids with a 
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(phase ~ 0.75), it has its lowest temperature, latest spectral type, and great- 
est inward velocity. The star achieves its maximum radius on the descending 
part of the light curve (phase ~ 0.4) and its minimum radius shortly after 
the next light-rise begins. The absolute magnitudes of Cepheids lie in the 
range —6 S My S —2 and their periods P range from several days to some 
hundreds of days. 

In many Cepheids both the fundamental and the first overtone mode 
are excited. Two independent mass estimates can be made for these stars: 
stellar-evolution theory provides an estimate from the star’s location within 
the CM diagram, while pulsation theory provides an independent estimate 
from the frequencies at which the star oscillates. For many years these two 
estimates tended to conflict, but the introduction of more accurate opacities 
in the early 1990s resolved these discrepancies. 

As we saw above, to a first approximation Cepheids satisfy a period- 
luminosity (PL) relation that reflects the variation with mass M of the fre- 
quency of the fundamental mode. Since the detailed structure of a star 
depends on its metallicity, it is evident that the PL relation should depend 
on metallicity. Moreover, since the color of a star of given mass varies with 
both temperature and metallicity, it follows that in a more precise treat- 
ment the periods, luminosities and colors of Cepheids should obey a period- 
luminosity-color, or PLC, relation. The non-zero slope of lines of constant 
P in Figure 5.13 quantifies this expectation that P should be a function of 
color as well as luminosity. 

The empirical determination of the PLC relation is problematic, how- 
ever, because Cepheids occur in regions of significant interstellar absorption. 
Hence non-negligible reddening corrections to colors are required, and it is 
difficult to make these with sufficient accuracy. Moreover, the full PLC rela- 
tion proves to be a non-linear function of P, My and (B—V)o — see Caldwell 
& Coulson (1986) for details. 
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On account of the complexity of the PLC relation, recent work has con- 
centrated on PL relations at infrared wavelengths. Observations of Cepheids 
at longer wavelengths have several advantages. The two most important of 
these are: 

1. In the IR reddening corrections are small, and their uncertainties are 
negligible. 
2. The PL relation is both tighter and steeper in the IR than in B. Hence 

observations in the K band lead to very much smaller errors in L for a 

given error in P than do observations in the B band. 


Since Cepheid light curves have smaller-amplitude variations in the IR than 
at optical wavelengths, IR observations typically lead to larger errors in 
P. On the other hand, the smallness of the fluctuations in an IR band 
such as K reduces the uncertainty in the estimate of the period-average of 
the apparent magnitude that one obtains from a small number of randomly 
phased measurements. Optimal results are obtained by measuring in B or 
V as well as in K so that the period can be accurately determined in B or 
V while the apparent magnitude is determined in K. A detailed discussion 
of the relative benefits of optical and infrared observations of Cepheids can 
be found in §3 of Jacoby et al. (1992). 

Feast & Walker (1987) have collated the results from a large number of 
PL determinations, and they find a best estimate for the V-band absolute 
magnitude of a Cepheid of period P to be given by 


(My) = —2.78log(P/10d) — 4.13 with scatter ~0.3mag, (5.19) 


where angled brackets again indicate the magnitude obtained by averaging 
the flux over a period. From a study of Cepheids in the Magellanic Clouds, 
Madore & Freedman (1991) find these PL relations: 


(Mg) = —(2.43 + 0.14) log(P/10 d) — 3.50 + 0.06 with scatter ~ 0.36 mag 
(My) = —(2.76 + 0.11) log(P/10d) — 4.16 + 0.05 with scatter ~ 0.27 mag 
(Mr) = —(2.94 + 0.09) log(P/10d) — 4.52 + 0.04 with scatter ~ 0.22 mag 
(Mr) = —(3.06 + 0.07) log(P/10d) — 4.87 + 0.03 with scatter ~ 0.18 mag 


(Mx) = —(3.42 + 0.09) log( P/10 d) — 5.70 + 0.04 with scatter ~ 0.13 mag. 
(5.20) 


Mira variables As they approach the top of the AGB, many stars become 
Mira variables ~ see Table 3.5. Although Miras lie a little to the red of 
the instability strip, their long-period (Z 80d), large-amplitude pulsations 
are thought to be driven by the x mechanism within zones of H and Het 
ionization just as Cepheid pulsations are. However, their pulsations are 
not well understood because their atmospheres are highly convective, and 
there is no definitive theory of pulsations in the presence of convection. In 
particular, it is unclear whether it is the fundamental or first overtone which 
is the excited mode in these stars (Gautschy & Saio 1996). 
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Figure 5.14 The distribution with 
respect to apparent magnitude of RR 
Lyrae variables in a field just south 
of the Galactic center. The main 
peak is associated with RR Lyraes 
in the Galactic bulge, and the sub- 
sidiary peak at fainter magnitudes is 
associated with stars in the Sagittar- 
ius Dwarf Galaxy, that lies behind 

Bs a the Galactic center. [After Alard 

16 14 12 (1996) from data kindly supplied by 
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The periods of Miras increase with luminosity; Feast et al. (1989) have 
determined the PL relation for Miras in the LMC. For a distance to the LMC 
of 50 kpc they find 


(Mx) = —(3.57 + 0.16) log(P/1d) + 1.21 +0.39. (5.21) 


Within the Galaxy, the periods of Miras are correlated with their kinematics 
in the sense that longer-period Miras are more strongly confined to the plane 
(Jura & Kleinmann 1992). Since hundreds of millions of years are required 
for a star to change its orbit significantly, and Miras certainly do not live so 
long, this correlation implies that any change in the period of a given Mira 
during its lifetime must be small. 


W Virginis stars As a low-mass, metal-poor star evolves away from the 
HB on the exhaustion of He in its core, it may enter the instability strip 
and become a Population II Cepheid. The prototype object of this class 
is W Virginis, so Population II Cepheids are frequently called W Virginis 
stars — we shall use this name to avoid confusion with the astronomically 
more important classical Cepheids. Since they are halo stars (see §10.5), 
they are found at high Galactic latitudes and in globular clusters, and they 
are concentrated toward the Galactic center. 

These stars are physically much more closely related to RR Lyrae stars 
than to classical Cepheids. However, their periods, 0.8d S P S 30d, are 
similar to those of classical Cepheids. They do not satisfy as tight a PL 
relation as do the classical Cepheids. Gautschy & Saio (1996) discuss W 
Virginis stars in some detail and give many references to the literature. 


RR Lyrae stars’ As we noted in §5.1.2, RR Lyraes are HB stars that 
happen to lie in the instability strip; their spectral types are centered around 
A. They are always old, relatively metal-poor stars with periods P ~ 0.5d. 
Their current masses are thought to be ~ 0.8 Mo, while their original MS- 
masses must have been ~ 1 Mo. 
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RR Lyrae stars with periods 2 0.5d and AS 2 5 [see equation (5.22) 
below] are commonly found in globular clusters, in the Galactic halo, and 
concentrated toward the Galactic center. Hence they are halo stars. RR 
Lyrae stars with periods < 0.4d and AS < 5 are concentrated to the Galactic 
plane and are old disk stars. 

Although RR Lyrae stars are much fainter than Cepheids, in regions of 
low interstellar absorption (‘Baade’s windows’ — see §10.2) they can be seen 
all the way to the Galactic center. In fact, surveys designed to identify grav- 
itational microlensing events (§2.4) have recently detected large numbers of 
RR Lyraes along with many other types of variable star (Alcock et al. 1995a). 
The numbers of RR Lyraes detected in windows on the Galactic center peak 
at a definite apparent magnitude and then fall off — see Figure 5.14. These 
observations can be used to estimate the Sun’s distance from the Galactic 
center (see §10.6). Moreover, because RR Lyrae stars are so readily identi- 
fied, they can be sampled in a given field to a high level of completeness, and 
hence they can be used in stellar-density analyses of the spheroidal compo- 
nent of our Galaxy (see §10.5.2). Finally, RR Lyrae stars are typically only 
2 to 3mag fainter than the brightest stars in globular clusters. Thus they 
are relatively easy to observe, in addition to being easily identified. Because 
they have well-defined absolute magnitudes, they provide good distance in- 
dicators for clusters, and they allow us to avoid having to observe down to 
the cluster MS (which is 5 mag fainter). 

RR Lyrae stars are classified into Bailey types a, b, c, and d accord- 
ing to the shapes of their light curves. Classes a and 6 have been amal- 
gamated into one class, RR,,. These stars have asymmetric (Cepheid-like) 
light curves with periods P,, 2 0.4d and an average period (P,») = 0.55d. 
In general, these stars have large amplitude luminosity variations, 0.5 mag S 
Am < 1.5mag, and pulsate in the fundamental mode. The RR, variables 
have symmetric (practically sinusoidal) light curves with smaller amplitudes, 
Am < 0.5 mag, and shorter periods, P. < 0.4d (average (P.) + 0.3d). These 
stars pulsate in the first-overtone mode. In RRg variables both the funda- 
mental and the first overtone modes are excited. A substantial majority of 
RR Lyrae stars are RR,» with the rest divided about equally between RR, 
and RRg. The RR, and most RRg variables are halo stars, while the RR, 
are old disk stars (see also §10.5). 

Because most RR Lyrae stars are metal poor, their metallic line spectral 
types (classified by the standard criteria) tend to be earlier than the spectral 
types appropriate to their hydrogen lines. Preston (1959) introduced the 
parameter 


AS = 10{spectral type (H lines) — spectral type (Ca II K lines)|, (5.22) 
where the spectral types are measured at minimum light in tenths of a spec- 


tral class. This parameter correlates closely with the metal content of a star; 
AS = 0 for stars of near solar abundances and rises to AS ~ 10 to 12 for the 
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most extreme metal-poor stars. AS has been calibrated in terms of [Fe/H] 
by means of curve-of-growth analyses of field RR Lyrae stars (Butler 1975). 
A very tight correlation, 


[Fe/H] ~ —0.16AS — 0.23, (5.23) 


is found. This relation can be applied to cluster variables to estimate metal 
abundances in globular clusters. 

There are essentially five independent methods of estimating My for RR 
Lyraes: (1) from the statistical parallaxes (see §2.2.4) of field RR Lyrae stars 
(Layden et al. 1996); (2) by the calibration of globular-cluster CM diagrams 
by MS fitting; (3) by combining the apparent magnitudes of RR Lyrae stars 
in the Magellanic Clouds with the known distance to the Clouds (§7.4.1); (4) 
the Baade—Wesselink method that is discussed in §7.2.1; and (5) by matching 
observations to the predictions of complete stellar models. Each method 
has advantages and disadvantages; the statistical parallaxes suffer from any 
errors present in the proper motions and from the intrinsic spread produced 
by variations in metal content among the stars. The problems with MS fitting 
of cluster CM diagrams are discussed in §6.1.2. Clearly, calibration from RR 
Lyraes in the Clouds is reliable only if (i) the RR Lyraes in the Clouds are 
identical to their counterparts in our galaxy, and (ii) the distance to the 
Clouds can be reliably determined without reference to Galactic RR Lyraes. 
Both the Baade—Wesselink method and the use of complete stellar models 
can be compromised by errors in stellar-structure calculations, including poor 
values for important opacities. 

RR Lyrae stars have absolute magnitudes in the range 0.5 S My S1, 
with a population-weighted mean value of ~ 0.6. The observed absolute 
magnitudes correlate quite closely with metallicity [Fe/H]. Jones et al. (1992) 
find 

(My) = (0.16 + 0.03)[Fe/H] + 1.02 + 0.03, (5.24) 


where angled brackets again indicate the magnitude obtained by averaging 
the flux over a period. Thus, if the metallicity of an RR Lyrae can be de- 
termined from the star’s spectrum, the star’s absolute magnitude can be 
predicted to within ~ 0.15 mag, and its distance then follows from its appar- 
ent magnitude to better than 10%. 

As with Cepheids, the dependence of the PL relation on metallicity be- 
comes less important at infrared wavelengths. Longmore et al. (1990) showed 
that the K-band magnitudes of RR Lyraes in a given globular cluster are 
extremely tightly correlated with P. Moreover, the form of this correlation 
was the same in all the clusters studied, independent of their metallicities. 
This suggests that RR Lyraes satisfy a universal K-band PL relation. Cal- 
ibration of the relation using the Baade-Wesselink method (87.2.1) yields 
(Jones et al. 1992) 


(Mx) = —(2.3 + 0.2) log(P/1d) — 0.88 + 0.06 . (5.25) 
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This relationship offers a very useful distance measure: the pulsation pe- 
riod is straightforward to measure, and the observed K-band flux is almost 
unaffected by obscuration. However, even with current sensitive infrared 
detectors, RR Lyrae stars are hard to detect in external galaxies, and they 
have yet to be used as distance estimators beyond the Local Group. 

Equations (5.24) and (5.25) show that the luminosities of RR-Lyraes are 
correlated with both period and metallicity. These two-variable correlations 
are projections of the full three-variable correlation (Jones et al. 1992) 


(Mx) = —(2.0 + 0.3) log(P/1d) + (0.06 + 0.04)[Fe/H] -0.7+0.1. (5.26) 


5.2 Synthesis of the chemical elements 


One of the most notable triumphs of theoretical astrophysics has been the 
provision of a convincing explanation of the origin of the chemical elements. 
Hydrogen is primordial, and the majority of the He was synthesized during 
the first three minutes of the Big Bang, but nearly everything else has been 
manufactured by stars. Hence the chemical compositions of stars and the 
interstellar medium provide important information about how many stars 
of different types have been active during the life of the Milky Way and 
other galaxies. In this section we review our present understanding about 
the amounts of heavy elements that stars of different types return to the 
ISM. It is helpful to start by recalling some basic nuclear physics. 


5.2.1 Basic nuclear physics 


An atomic nucleus may be considered to be a bound state of a certain num- 
ber Z of protons and a certain number N of neutrons. Hence each nucleus 
occupies a certain point in the (Z, N) plane, and each element is associated 
with column of fixed Z in this plane. Since it is sometimes useful to distin- 
guish between points in the (Z, N) plane and a nucleus as a physical entity, 
we shall refer to a point in the (Z, N) plane as a nuclide, reserving ‘nucleus’ 
for a physical object. The atomic number of a nuclide is A= 7+ N and 
one conventionally specifies a particular nuclide by preceding the chemical 
symbol associated with the Z** element in the periodic table of the elements 
with a superscript A: for example 1*O has A = 16 and Z = 8 and therefore 
lies at the point (8,8) in the (Z, N) plane. 

Only a few of the points in the (Z, N) plane are associated with bound 
nuclear states, and these lie in a band, the stability band, that runs roughly 
parallel to, but centered just above, the lime N = Z. Many of these bound 
states are liable to spontaneous radioactive decay into other states ~ Fig- 
ure 5.15 shows the diagonal motions within the (Z, N) plane of a nucleus 
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Z Figure 5.15 Motion in the (Z, N) 


plane due to radioactive decay. 


that experiences either G+-decay (emission of an electron or positron) or 
a-decay (emission of an a-particle; i.e., a He nucleus). 

The binding energy per proton or neutron of nuclides, €, varies system- 
atically within the (Z,N) plane. As one moves across the stability band 
along a line A = Z+ N = constant, the absolute value of € first increases 
and then decreases. Thus if we imagine each nuclide to lie a distance |e| 
below the (Z,N) plane, the stability band takes the form of a valley, and 
hence is often called the stability valley. The bottom of this valley slopes 
from each end towards the middle. The deepest point in the valley, °°Fe, 
lies 8.79 MeV below H at the valley’s bottom left end, and 1.21 MeV below 
238UJ at the valley’s top right end. Stars derive much of the energy that they 
radiate by shifting their protons and neutrons down the valley from H and 
He towards Fe. 

It proves useful to identify a number of groups of nuclides within the 
(Z, N) plane. Since it is generally only possible to distinguish different iso- 
topes of the same element within the solar system, elements are frequently 
assigned to the same group as their commonest isotope. 

20Ne, 24Mg, 78Si, 32S, 3°A and 4°Ca comprise the a nuclides. They owe 
their name to the fact that they can be formed by adding 2,3,...a@ particles 
to 1®Q0.15 These nuclides form both during C- and O-burning by reactions 


such as 
oye ar 20Ne + *He 


5.27 
160 + 169 ay 28g; ie 4He ( ) 

and through the capture of a particles in reactions such as 
?°Ne+4He — 74Mg+ 7. (5.28) 


They are primary nuclides in the sense that they can be made in a star that 
starts out with nothing but hydrogen and He, and, moreover, the amount 


15 Sometimes the next stable nuclide in this sequence, 4°Ti, is considered to be an a 
nuclide, but by our definition below it belongs to the iron-peak group. 
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produced in any given star is essentially independent of the star’s initial 
metallicity. By contrast, a secondary nuclide is one that requires for its 
synthesis the presence of a primary nuclide that was synthesized in an earlier 
generation of stars. 

The very direct routes of equations (5.27) and (5.28) to the formation of 
the @ nuclides are reflected in the abundances of these nuclides: they are all 
more abundant than their immediate neighbors in the (Z, N) plane. More- 
over, the abundances of these nuclides decrease smoothly with increasing 
atomic number. 

?3Na and 27 Al are the sole stable isotopes of these two odd-Z elements. 
They are related to the a nuclides in as much as they are products of C and 
Ne burning in hot stars — see equation (5.29) — and if this process were entirely 
responsible for their synthesis, they would be primary elements. However, 
they are also synthesized when He is burned in the presence of !®O, so they 
are not primary elements. 

The iron peak nuclides are those with atomic number in the range 
40 < A < 65 — this range corresponds to the elements Sc, Ti, V, Cr, Mn, 
Fe, Co, Ni and Cu. The floor of the stability valley is here rather flat, with 
the consequence that the most stable isotopes of the elements that lie in the 
iron peak have similar binding energies. The iron-peak nuclides are primary 
nuclides that form late in the evolution of a star when the core is exceedingly 
hot. In fact, the temperature is usually so great that nuclei are constantly 
forming and then splitting apart, with the result that the system is able to 
come into thermal equilibrium. This fact greatly simplifies the calculation 
of the relative abundances of the various nuclides, since the probability of 
each possible configuration is simply proportional to the Boltzmann factor 
e =/kT where E is the configuration’s energy. The flatness of the valley 
floor around the iron peak has the consequence that in thermal equilibrium 
many different nuclides are present in significant numbers. 

The s-process nuclides lie along the valley floor up towards higher A 
from the iron peak. The ‘s’ stands for ‘slow’ for a reason that will emerge 
shortly. Abundant s-process nuclides include ®°Sr, ®°Y, °°Zr, %Ba, 1°°La, 
140Qg 141py, 208pp, 209 pj, 

Certain reactions that take place during nuclear burning release free 
neutrons. For example, during C-burning a small fraction of colliding C 
nuclei undergo the reaction 


2q4%C — *Na-+n. (5.29) 


Because it experiences no Coulomb repulsion on approaching a nucleus, the 
free neutron will quickly be absorbed by a nucleus, to form a heavier isotope 
of the same element. For example, it could convert ®°Fe to °’Fe, which is 
also stable. Subsequent absorption of free neutrons might convert the °’Fe 
nucleus to °°Fe and then to **Fe. Now °°Fe is unstable (half-life 45d) and 
soon 3~-decays to °®Co. Absorption of yet another neutron now produces 
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Figure 5.16 Lower panel: the solar abundances (on an arbitrary 
scale) of 34 pure s-process nuclides versus atomic number. Upper 
panel: each abundance multiplied by the corresponding neutron- 
absorption cross section. [From data published in Kappeler, Beer 
& Wisshak (1989)| 


6°Co, which is unstable (half-life 330d) and 8~-decays to ®°Ni. Proceeding 
in this way our original °°Fe nucleus moves ever further up the periodic table. 
At any given time within the nuclear-burning regions of a star there will be 
nuclei at all different points in this chain of transmutations, and since all 
nuclei are exposed to the same flux of neutrons, the time any given stable 
nucleus spends at a given point in the chain will be inversely proportional 
to its current cross-section for the absorption of a neutron. Hence when 
the neutron-generating reactions cease, and with them motion up the chain 
comes to a halt, the number of nuclei at each point in the chain is inversely 
proportional to the local neutron-absorption cross-section. Figure 5.16 shows 
the abundances of the s-process elements before and after multiplication by 
their neutron-absorption cross-sections. The post-multiplication values are 
not actually independent of A as our naive analysis suggested would be 
the case, but they do decline smoothly with increasing A, which the raw 
abundances do not. 

The s-process produces secondary elements in the sense that it requires 
iron-peak elements to be present in a C- or O-burning zone. Since a star 
synthesizes iron-peak elements only at the very end of its life, after it has 
largely ceased burning C and O, these iron-peak elements must have been 
present in the star at its birth, having been synthesized in some other star. 
Naively one would expect the abundances of secondary elements to be pro- 
portional roughly to the square of the abundance of primary elements. For 
example, suppose the rate for formation of primary elements is constant in 
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time: 
daprim = 


Go 


Then given that the rate of formation of secondary elements is given by 


(5.30) 


dase 
re = C2@prim; (5.31) 


where c2 is a second constant, integration of equations (5.30) and (5.31) with 
respect to time reveals that 


c 
Asec (t) = $1 Ct” = 55 Apri (t)- (5.32) 
2 


In §§8.2.6 and 10.4 we shall see that the observed slope of the mean relation 
between the logarithmic abundances of secondary and primary elements is 
significantly smaller than two. The likely reason is that the density of ‘neu- 
tron poisons’ increases with metallicity - a neutron poison is a nuclide 
such as ?He or !4N that readily absorbs neutrons. The increasing density 
of neutron poisons diminishes the neutron flux, and the rate of formation 
of s-process nuclides is obviously proportional to the product of the neutron 
flux and the density of heavy seed nuclei. 

The r-process nuclides lie on the neutron-rich side of the valley floor 
just above the s-process nuclides — abundant representatives include ®°Se, 
81By, 84Ky, 128:180T, 1277, 1925, 193], 196,198pt The ‘r’ stands for ‘rapid’ 
because these nuclides, like the s-process nuclides, form by neutron capture 
but by rapid rather than slow neutron capture. The key issue is this: does 
the first unstable nuclide that is encountered on moving vertically up the 
(Z,N) plane from the bottom of the stability valley have time to {-decay 
to a stable nuclide of higher Z before a further neutron is absorbed? If 
the answer is ‘yes’, the nucleus shuffles along the chain of s-process nuclides, 
which lie near the bottom of the stability valley. If the answer to the question 
is ‘no’, the nucleus is driven up the side of the valley, where nuclides are 
more unstable, until the local half-life to G-decay becomes comparable to 
the mean time between neutron captures, and then jogs along the length of 
the valley. When the neutron-generating reactions cease, each nucleus slides 
down towards the valley floor by G-decay, coming to rest at an r-process 
nuclide. Notice that the abundances of these final nuclides are not expected 
to be inversely proportional to their neutron-capture cross-sections. 

The distinction between r- and s-process nuclides is not a sharp one be- 
cause many nuclides can be synthesized by both processes. The distinction 
is sharpest when several stable nuclides lie on the neutron-rich side of the 
valley of stability and are isolated from the floor itself by unstable nuclides. 
Figure 5.17 illustrates this phenomenon by showing the valley in the region 
of Sb, where !4Sn and !?8Te form such islands and respectively ensure that 
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Figure 5.17 The (Z, N) plane in the neighborhood of Sb. Stable 
nuclides have shaded squares. An ‘r’ indicates that a nuclide can 
be formed only by the r-process, while an ‘s’ indicates a nuclide 
that can be formed only by the s-process. Stable nuclides that can 
be formed by either process are marked ‘rs’. 


124Te and 1?8Xe are pure s-process nuclides. These isolated nuclides can be 
synthesized only by the r-process, while those on the valley floor are syn- 
thesized only by the s-process. Nearly all elements heavier than Fe have 
isotopes that can be synthesized by the s-process, but the s-process some- 
times leads to small abundances of these isotopes because they have large 
neutron-absorption cross-sections. 

It is at this stage unclear whether elements produced by the r-process 
are primary or secondary, because we have not established at what stage in 
a star’s evolution material might be exposed to the large flux of neutrons 
that is required for the r-process. 

The discussion above indicates that four processes contribute to the 
production of metals: (i) straightforward He- C- and O-burning produces 
not only 12C, '®O but also the a nuclides; (ii) the iron-peak nuclides are 
formed when nuclei achieve approximate thermal equilibrium in the hot 
cores of evolved stars; (iii) s-process elements are produced by slow neutron- 
irradiation of heavy nuclides that were synthesized in an earlier generation 
of stars; (iv) r-process elements are produced by rapid neutron-irradiation 
of heavy nuclides that may or may not have been synthesized in an earlier 
generation of stars. Now we have to understand (a) at what stages of stellar 
evolution the r-process occurs, and (b) how metals, once formed, are injected 
into the ISM. 


5.2.2 Metal production at M; < Mup 


We saw in §5.1.3 that the remnants of stars are less massive, often much less 
massive, than their progenitor stars. Our interest focuses on the chemical 
composition of the material that a star returns to the ISM rather than locks 
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up in its remnant. Stars of mass Mj < My, form white dwarfs, and these 
must contain most of the C, O and a@ elements that the star synthesizes. 
Thus, in a naive picture, these stars enrich the ISM only with He. However, 
the phenomenon of carbon stars (see §5.1.2) makes it clear that this picture 
is over-simplified. At points in their evolutions, stars are liable to dredge the 
products of nuclear processing up from their interiors into their envelopes, 
and this material will pass to the ISM when the envelope is ejected. 

Spectroscopic analyses of planetary nebulae in the Milky Way and the 
Magellanic Clouds yield estimates of the abundances of He, N, O, and some- 
times C in the ejecta. To determine from these data the mass of each element 
that was synthesized in the precursor star, we clearly need to know the differ- 
ence between abundances of the ejecta and those characteristic of the ISM in 
the general vicinity of the planetaries. Unfortunately this difference is small, 
and correspondingly hard to determine. The difference is largest and easiest 
to measure in systems with a metal-poor ISM, such as the SMC. Monk, Bar- 
low & Clegg (1988) find that in the SMC [N/H] is larger in planetaries than 
in HII regions by ~ 1 dex; in LMC and Galactic planetaries this difference 
falls to ~ 0.8 and ~ 0.4, respectively. The enhancements in the abundances 
of other elements are smaller and uncertain. For example, the enhancement 
in [O/H] for planetaries is ~ 0.3 in the SMC and unmeasurably small in more 
metal-rich systems. The enhanced production of N to which these observa- 
tions point is interesting because much of the N that these stars produce is 
thought to be made when C nuclei absorb free neutrons. The neutrons that 
are responsible for this transmutation of C into N will at the same time be 
forming s-process elements. For this reason planetary nebulae are believed 
to be an important source of s-process elements. 

In summary, heavy-element production by stars with M;, < Mup prob- 
ably plays an important role in the chemical evolution of galaxies, especially 
early on when the ISM is relatively metal-poor. Unfortunately, the sheer 
quantity of gas that these stars eject makes it difficult to determine obser- 
vationally the rate at which these stars pump freshly made metals into the 
ISM. This difficulty is unfortunate because models of stellar evolution cannot 
reliably predict the mass of heavy elements that is typically carried into a 
stellar envelope and then ejected. 


5.2.3 Supernovae 


Supernovae are believed to be the prime sources of both iron-peak and r- 
process nuclei. They are classified according to their spectra near maximum 
brightness — see Harkness & Wheeler (1990). The spectra of type II su- 
pernovae contain hydrogen lines while those of type I supernovae do not. 
Type I supernovae whose spectra show absorption due to Sit are called type 
Ia supernovae, while all other type I supernovae are classified as type Ib 
supernovae. The spectra of type Ib supernovae usually show He lines and, 
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as we shall see, they are probably physically more closely related to type 
II supernovae than to type Ia supernovae.'® In §7.3.3 we shall see that the 
peak luminosities of type Ia supernovae are quite tightly bunched around 
Lg = 9.6 x 10° Lo. Type II supernovae have peak luminosities that span 
a broader range: 0.4 x 109 S$ (Lg/Lo) S 4 x 10° (Tammann & Schréder 
1990). 

Type II supernovae do not occur in early-type galaxies, while type Ia 
supernovae occur in all types of galaxy. Moreover, in spiral galaxies type 
II supernovae, like OB stars, tend to occur in spiral arms, while type Ia 
supernovae show no such preference. Since spiral arms are delineated by 
short-lived massive stars and early-type galaxies do not appear to contain 
such objects, these observations suggest that the progenitors of type IT su- 
pernovae are massive stars, while those of type Ia supernovae are not. Since 
a distinction has been drawn between type Ia and type Ib supernovae only 
since the mid 1980s, the statistics on the occurrence of type Ib supernovae 
are poor. However, the well observed members of the Ib class have occurred 
in spiral galaxies near spiral arms just as type II supernovae do. In the Milky 
Way the supernova rate is thought to be of order one per 40 years, and only 
~ 15% of supernovae are of type Ia (Tammann, Loffler & Schréder 1994). 

The physical interpretation of the different kinds of supernova is thought 
to be the following (Woosley 1990). Both type Ib and type II supernovae 
occur when a high-mass star suffers core collapse — for this reason we shall re- 
fer to them collectively as core-collapse supernovae. Type Ib supernovae 
probably attend the deaths of the most massive stars, which have lost their 
hydrogen envelopes, either to a stellar wind or a binary companion, before 
suffering core collapse. Hence at the moment of core collapse, these objects 
have He rather than hydrogen envelopes, and thus fail to display hydrogen 
lines. Type Ia supernovae are thought to result from the thermonuclear ex- 
plosion of a C/O white dwarf. This explosion is probably triggered by the 
accretion of material from a binary companion. 


Metal production by core-collapse supernovae It is relatively simple 
to calculate the structure of a massive star at the moment of core collapse. 
A much harder task is to determine (i) how much of the star will be blown 
away by the imploding core, and (ii) what changes will occur in its chemical 
composition as it is blown away. What makes the problem extremely difficult 
is that 99% of the energy released emerges in a vast blast of neutrinos. It 
is extremely hard to determine accurately which tiny (~ 1%) fraction of 
the energy of implosion is transferred to the stellar envelope, and thus to 
determine whether the envelope is ejected and a supernova produced. In 
‘act, some implosions may not produce readily detectable electromagnetic 
‘adiation and thus go undetected. 


16 Some authors actually require type Ib supernovae to show He lines, and classify type 
supernovae that show neither Si nor He lines as type Ic supernovae. 


304 Chapter 5: Evolution of Stars and Stellar Populations 


Table 5.4 Chemical compositions of stars outside the mass- 
cut at core collapse 


Ma, My, He Cc O Ne Mg Si+Ca Fe 


2.7 10 0.90 0.048 0.004 0 0 0 0.02 
3 12 1.52 0.048 0.004 0 0 0 0.03 
4 15 2.04 0.192 0.167 0.040 0.066 0.101 0.05 
6 20 2.58 0.288 0.774 0.426 0.254 0.265 0.07 
8 25 3.10 0.568 1.660 0.767 0.270 0.339 0.10 
12 31 43.59 1.220 3.840 1.040 0.311 0.424 0.14 
16 39 4.06 1.620 6.160 1.420 0.405 0.979 0.19 
24 55 5.02 2.300 11.700 1.980 0.624 0.934 0.24 
32 85 8.62 2.050 16.600 2.350 0.970 1.190 0.30 


SOURCE: Adapted from Arnett (1991) 


In view of the difficulty of modeling the violent ejection of a massive 
star’s envelope, in the 1970s theorists developed the following rough-and- 
ready treatment of mass ejection in massive supernovae. One defines a radius 
within the pre-collapse star such that everything outside this radius is ejected 
and everything interior to it falls in to form the remnant. The shock wave 
that is responsible for ejecting the material that lies just outside this mass- 
cut induces a violent burst of nuclear reactions in the departing material. In 
particular, a substantial mass of Si is burnt to iron-peak nuclides, including 
the radioactive nuclide °°Ni. During this nuclear flash, iron-peak elements 
are subject to intense neutron bombardment, and many are converted to 
r-process elements. Further out, layers of material that had long ago been 
processed by the star to C, O, a nuclides and s-process nuclides are ejected by 
the shock. Hence core-collapse supernovae contribute every kind of nuclide 
to the ISM — iron-peak, H, He, C, O, a, s- and r-process nuclides. 

Table 5.4 lists for several stars the masses (in solar masses) of the various 
elements that they are expected to eject when their cores implode. These 
numbers have been obtained by calculating the composition of each star 
immediately prior to core collapse and choosing a mass-cut. Each element 
is approximately confined to a spherical shell, with the lightest element on 
the outside and the heaviest in a shell around the core. The first column in 
Table 5.4 gives the mass of the He core at the onset of He-burning — this is 
effectively what determines the subsequent evolution of the star. The second 
column gives the initial mass that would lead to such a He core in the absence 
of mass loss; the actual initial mass will be larger, probably by a significant 
factor in the cases of the most massive stars listed. The subsequent columns 
give the masses of the various elements that lie outside the adopted mass 
cut. 

The fundamental correctness of this picture of the working of massive 
supernovae was confirmed by studies of the type II supernova 1987A, which 
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was observed to explode in the LMC on 1987 February 23. Since it was so 
close to us (~ 50 kpc) it could be observed in incomparably greater detail 
than any other supernova has been, or indeed is likely to be for decades, 
perhaps centuries, to come. In particular, the spectacular burst of neutrinos 
that carries away most of the energy released by a supernova was observed. 
Such a burst had long been predicted by theory but never before been de- 
tected, and, because it lasted only ~ 10s, we could determine the precise 
origin in time of all other observations. Arnett et al. (1989) and McCray 
(1993) review this and many other aspects of SN 1987A. 

Models of SN 1987A suggest that in its progenitor star He ignited when 
the He core contained 6 M.. If the progenitor has been correctly identified 
on archival plates, it was a blue rather than a red giant (MK type B3 I), pre- 
sumably because it had suffered significant mass loss prior to the explosion. 
From Table 5.4 it follows that its initial mass exceeded 20 Mo. Models of 
neutrino emission by the collapsed core suggest that the latter had a mass 
~1.6Mo. Of particular importance is evidence that from about 100d after 
the explosion, the supernova’s light emission was largely powered by the ra- 
dioactive decay of 0.069 Mg of °®Ni (mean-life 8.8 d) to first °°Co and then 
56Fe. Specifically, in the interval 100 — 300d, the shape of the light curve is 
a declining exponential with a time-constant that perfectly fits the 111.3d 
mean-life of °°Co, and 0.069 Mg of °°Ni is required to generate the observed 
luminosity. (The mass cut upon which Table 5.4 is based has been chosen 
to yield this quantity of Ni, and hence after its complete decay of Fe.) After 
300d the light curve declined faster than exponentially because the ejecta 
had become too rarefied to degrade into optical photons all the y rays that 
carry much of the energy released by the decaying Co. Indeed, from about 
175d the 837 keV and 1240keV 4-ray lines of °°Co were directly detected. 

The observed spectra of SN 1987A imply a total kinetic energy ~ 1.5 x 
104 J in the ejecta. P-Cygni profiles of various spectral lines (Figure 3.4) 
indicated that the outermost layers of the star’s envelope were ejected at 
>= 20000kms~!. This material was soon slowed to ~ 2500kms~1, however. 
The structure of the envelope was made highly complex by the development 
of hydrodynamical instabilities soon after the passage of the blast wave. 
The complexity of the envelope has defeated efforts to obtain from observed 
spectra accurate estimates of the amounts of the various metals that were 
thrown off by SN 1987A — see McCray (1993) for a review of this topic. 


Metal production by type Ia supernovae The progenitors of type Ia 
supernovae are believed to be C/O white dwarfs that by accretion from a 
binary companion achieve a mass 1.38. Mo, just barely smaller than Chan- 
drasekhar’s limiting mass for a C/O white dwarf. The electrons near the 
center of such a star are highly degenerate, so that the star’s nuclear fuel is 
liable to ignite explosively — see Box 5.2. The details of how this ignition takes 
place are not fully understood because the event proves hard to model with 
sufficient accuracy, but there is a consensus that a wave of burning sweeps 
through the star in which about half of the star’s C and O are converted to 
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iron-group elements in about a second. The predicted light-curve and mix of 
nuclides produced by this event depend sensitively on the speed with which 
the wave sweeps through the star. The best agreement with observation is 
achieved when the wave is a deflagration wave, which propagates nearly 
as fast as the sound speed, but not as fast or faster — see Woosley (1990) for 
a discussion of the dynamics of type Ia supernovae. 

The explosion of a C/O white dwarf releases only 1% of the energy that 
is released in a type II supernova, but a large fraction of it is channeled 
into the kinetic energy of the ejecta. Consequently, the mechanical energy 
of a type Ia supernova is, to within the uncertainties, identical with the 
mechanical energy of a core-collapse supernova: ~ 1044 J. Since this energy is 
concentrated in 1.4 Mg rather than over 10 Mo of ejecta, the characteristic 
ejection velocity is larger: (10—20) x 10*kms~?. The star leaves no remnant, 
being completely disrupted by the event. The entire UV and optical display 
of a type Ia supernova derives from radioactive decay, overwhelmingly of 5&Ni 
to 5®Fe. A type Ia supernova produces roughly 5-10 times as much °°Fe as 
SN 1987A did: ~ 0.7 Mo, of which ~ 0.45 Mo forms by the decay of *®Ni. 
Unfortunately, there are significant uncertainties in most of these numbers. 
Clearly, type Ia supernovae are significant sources of r-process elements. 


5.3 Models of chemical enrichment 


We now study models of the enrichment of stellar systems that are valuable 
for the interpretation of both observations of external galaxies (§8.2.6) and of 
stars within the Milky Way (§10.4). The three models that we consider differ 
from one another in the nature of their interaction with their surroundings: 
the first model is a completely isolated system; the second model vents metal- 
rich gas while the third model accretes metal-free gas from its surroundings. 


5.3.1 The closed-box model 


The closed-box model was introduced by Talbot & Arnett (1971) as the sim- 
plest possible model for the chemical evolution of a portion of a galaxy such 
as the solar neighborhood. It focuses on a narrow annulus of galactocentric 
radius, and assumes that in the period under study no material either en- 
ters or leaves this region. Initially, the material is entirely gaseous and free 
of heavy elements. As time goes on, stars are formed from the interstellar 
gas and massive stars return hydrogen, helium, and heavy elements to the 
interstellar medium. We assume that turbulent motions keep the gas well 
stirred and therefore homogeneous. The supply of interstellar gas is gradu- 
ally consumed, and the remaining gas becomes steadily more polluted with 
heavy elements. 

Suppose that at any time there is a mass My, of heavy elements in the 
interstellar gas, which itself has mass M,. Then the metallicity of the 
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interstellar gas is 
My 
Ms 


We suppose that at this time the stars have total mass M, and consider the 
effect of forming a mass 6’M, of new stars. For simplicity we adopt the in- 
stantaneous recycling approximation. In this approximation we neglect 
the delay between the formation of a generation of stars and the ejection of 
heavy elements by that generation’s massive stars. Since the progenitors of 
core-collapse supernovae have lifetimes that are very much shorter than the 
Hubble time, this approximation is reasonable. It is, however, not valid as 
regards the contributions to metal enrichment of either type Ia supernovae 
or of stars of a few solar masses. The theory can be refined to take into 
account the time delay between star formation and the ejection into the ISM 
of heavy elements — see Pagel (1997). 

Let the mass of the stars in 6’M, that remain after the massive stars 
have died be 6M,, and let the mass of the heavy elements produced by this 
stellar generation be p6M,, where p is the yield of that generation of stars. 
Then the total change in the heavy element content of the interstellar gas 
that arises from these new stars is 


Z= (Zo ~ 0.02). (5.33) 


bMy = piM, — Z6Ms = (p — Z)bMs. (5.34) 


The metallicity of the interstellar gas changes by an amount 


6Z = 6 (Ze) Lidl eal yy ims ~ZbMgz). (5.35) 
g 


Mz M, M2 
By conservation of mass 6M, = —6M,, so, combining equations (5.34) and 
(5.35), we obtain 
6M 
6Z = ~p—. 5.36 
PA (5.36) 


If the yield p of each generation of stars is the same, we may integrate 
equation (5.36) to obtain the metallicity at time ¢ as 


TH: 


Z(t) = —pln | (5.37) 


Here we have employed our assumption that the gas is initially free of metals. 

Since M,(0) is the total mass of the annulus, equation (5.37) makes 
a clear prediction: if we plot the observed metallicity Z at different points 
within galactic disks against the logarithm of the fraction of the local mass 
density that is in gaseous form, we will obtain a straight line. The absolute 
value of the slope of this straight line will be the yield p. In §8.2.6 we shall 
review observational evidence for a relationship of this type. 
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In addition to predicting a relationship between the metallicity of the 
ISM and the fraction of the system’s mass that is still in the ISM, the closed- 
box model also predicts the metallicity-distribution of stars. Specifically, by 
equation (5.37), the mass of the stars that have metallicity less than Z(¢) is 


M,[< Z(t)] = Ms(t) = Mg(0) — Mg(t) 


7 (5.38) 
= M,(0)[1 — e-2/, 

This equation predicts that stars will be widely distributed in metallicity. 
In particular, a significant fraction of all stars will have metallicities smaller 
than, say, a third of the metallicity of currently-forming stars. Quantitatively, 
at time t the fraction of stars with Z < aZ(t) is 


M,[< aZ(t)] 1 —e7%4/P 
M,[< Z(t)] ~ 1—e-2@/p 
ee oe 


(5.39) 


l-«z 


where equation (5.37) for Z/p has been used and z = M,(t)/M,(0) is the 
remaining gas fraction. A reasonable estimate for x in the solar neighborhood 
is « ~ 0.1. When this is substituted into equation (5.39) one finds M,(}Z) = 
0.51M,(Z). That is, if the solar neighborhood were a closed box and had 
started from zero initial metallicity, fully a half of all stars would have less 
than a third of the metallicity of the most metal-rich stars. In Chapter 10 
we shall see that there are in reality many fewer metal-poor stars near the 
Sun. 

Unfortunately, to determine M,(Z) observationally we have to be able 
to resolve individual stars, and this requirement in practice confines our 
knowledge of M,(Z) to the stellar populations of the Milky Way. 


5.3.2 The leaky-box model 


In the Milky Way stars form in molecular cloud complexes, and these systems 
are observed to be by no means closed boxes. For reasons that are inade- 
quately understood, only a small fraction of the gas in a molecular cloud has 
been turned into stars by the time stars formed early in the cloud’s life begin 
to explode as core-collapse supernovae. These supernovae, together with fast 
winds from massive stars that have not yet completed their lives, drive strong 
shocks into the cloud’s remaining gas. The cumulative effect of these shocks 
is to blast the remaining gas out of the cloud. It is likely that something 
similar happened on a larger scale during the formation of globular clusters 
and elliptical galaxies. Hence we now investigate the chemical evolution of a 
leaky box; that. is, one from which gas is driven by stars that form within 
the box. 
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We suppose that supernovae drive gas out of the box at a rate that is 
proportional to the star-formation rate: 


dM, dM, 
dt dt ’ 


(5.40) 


where M, is the mass of stars and gas in the box and c is a constant. 
Integrating this equation we find that M,(t) = M,(0) — cM,(¢), and thus 
by conservation of mass we have for the gas mass at any time, 


Mg(t) = Mi(t) — Ms(¢) 


= M,(0) — (1+ ¢)M,(t). ay) 


An analysis similar to that which leads to equation (5.36) shows that if the 
gas expelled had the same metallicity as the rest of the interstellar medium, 


dZ.)2 ip Dp 
dM, M, M,(0)—(1+c)M,’ 


(5.42) 


Integrating this differential equation subject to the initial condition M,(Z = 
0) = 0 yields 


Miz) = a) {1 Ey | - oo!) \ (5.43) 


Comparing this expression with equation (5.38), we see that the only effect 
of the steady outflow of interstellar gas is to reduce the effective yield to 
p/(1 +e). 

Hartwick (1976) formulated the leaky-box model to reproduce the ob- 
served metallicity distribution within the Milky Way’s halo. We shall discuss 
these results in §10.7 and here concentrate on an entirely different appli- 
cation of the model. In §4.3.4 we saw that the line-strengths of elliptical 
galaxies are tightly correlated with their central velocity dispersions (see 
Figure 4.42). Consider the mean stellar metallicity that is predicted by the 
leaky-box model. In this model star formation ceased when the gas was ex- 
hausted. By equation (5.41) this occurred when M, = M,(0)/(1+c). Thus 
the present mean stellar metallicity should be 


yA He POM 
*— Mz(0) Jo . 


(5.44) 


When we use equation (5.43) to evaluate the integral in this equation, we 


obtain os 
Qa. (5.45) 
Ite 
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Figure 5.18 Full curve: the cooling function A of a hydrogen-He 
mixture in collisional equilibrium with 10% He by number. Dashed 
curve: the same quantity for gas with solar abundances. The ra- 
diative energy loss is Q = An2 Wm, where ne is the electron 
density. [From data kindly supplied by R. Edgar] 


If the system turns all its gas into stars, c = 0 and Z, = p, but if supernovae 
are able to drive most of the original gas out of the system, c >> 1 and 
Zs ~ p/c. Thus the effective yield is expected to decline in step with the 
ease with which the hot, metal-rich component of the interstellar medium 
could flow out of the galaxy. 

Figure 5.18 shows as functions of temperature the cooling rates of both 
primordial gas (full curve) and of gas that contains solar abundances of 
heavy elements (dashed curve).!7 In both cases the cooling rates rise from 
very small values at T’ < 1000 K to peaks at T ~ 2x 10* K that are associated 
with the ionization of hydrogen. A second peak at T ~ 10° K is associated 
with the ionization of He. Beyond this peak the cooling rate of primordial gas 
declines steadily to a shallow minimum near JT’ = 2 x 10° K. The cooling rate 
of gas that contains heavy elements shows several subsidiary peaks beyond 
that associated with He ionization, but nonetheless declines fairly steadily 
from T ~ 3 x 10°K to T ~ 2.5 x 10’K. The significance of these cooling 
rates for the present investigation is the following. 

Equilibrium between heating and cooling processes is unlikely to be es- 
tablished at temperatures for which the relevant cooling curve falls with 
rising IT. Indeed, at such temperatures, any small excess of heating over 
cooling would cause the temperature to rise and the cooling rate to fall, 
thus exacerbating the original small disequilibrium. Moreover, in the region 


17 The cooling rates in Figure 5.18 assume that photons created by the gas can escape 
freely and that the gas is the only source of radiation. 
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10?K < T S 10° K, cooling is so effective that, in the absence of strong heat- 
ing, gas in this region generally cools to T < 10+ K. Thus, under equilibrium 
conditions we expect the temperature of any given parcel of gas to satisfy 
either T < 104K or T > 3 x 10° K. 

In general we expect any given stellar system to contain both cool (T < 
10* K) and hot (T 2 3 x 10° K) parcels of gas — that is, its ISM will consist of 
two phases, one hot and one cold. The one-dimensional velocity dispersion 
that corresponds to temperature T is 


Ogss = (Te) ~ ue(4) kms7?, (5.46) 


Hence the velocity dispersion associated with the hot component of the ISM 
is o, ~ 200kms~!. For comparison, stars at the center of a globular cluster 
or dwarf spheroidal galaxy have dispersion o, ~ 5 — 10kms~! (see §6.1.11), 
while at the center of a giant elliptical galaxy they have o, > 200kms—! 
(cf. Figure 4.42). Hence in a globular cluster or a dwarf spheroidal galaxy, 
oy, > o, and the hot phase of the ISM is likely to flow freely out of the sys- 
tem, with the consequence that the system experiences little self-enrichment. 
By contrast, in a giant elliptical galaxy o, ~ o, and we expect the system’s 
gravitational field to be able to confine the hot component of the ISM ap- 
proximately as well as it confines the system’s stars,!® with the consequence 
that giant elliptical galaxies should experience substantial self-enrichment. 

To obtain a quantitative relation between o, and Z we estimate the rate 
of ejection of gas from a giant elliptical galaxy by equating the rate at which 
the ejected gas gains potential energy to the rate of injection of energy into 
the ISM by supernovae. Thus with equation (5.40) we write 


aM, 
dt ~ 


dM, 
dt 


2 IMs 
~ fore = €sn 


rr (5.47) 


s 


Here 6 ~ — fo? is the typical gravitational potential from which the ejected 
gas is lifted, f is a number of order a few, and esn is the energy injected 
into the ISM per unit mass converted into stars. For the IMF characteristic 
of the solar neighborhood (see §5.1.9), one core-collapse supernova explodes 
per ~ 200 Mo of fresh star formation. Given that each supernova injects 
~ 1044 J, esn & 1044/200 Ma ~ 2.5 x 10'! Jkg”*. Inserting this estimate 
into equation (5.47) we find 


dass oa 


This equation suggests that c will increase from a value smaller than unity 
for giant elliptical galaxies to one of order a few for dwarf ellipticals. Since 


(5.48) 


Ox 


18 In §8.3 we shall encounter direct observational evidence for this conjecture. 
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we have assumed that none of mechanical energy output of supernovae is 
radiated rather than used to lift gas from the galaxies, our estimate of c 
is probably on the low side. (But see Problem 5.12 for an indication that 
cooling cannot be very effective.) Hence it is probably sensible to treat the 
factor 0.6/f in (5.48) as a free parameter to be fitted to the data and to 
focus on the predicted dependence c x oF”. Inserting this dependence into 
equation (5.45) for the final mean metal-abundance of a leaky box’s stars, 
we obtain the prediction 


7 Pp 

aoe (5.49) 
where oz is a parameter to be determined from the observations. While 
Figure 4.42 shows this prediction to be qualitatively correct, quantitative 
confrontation of theory and observation must await determination of the 
relationship Z,(Mg.) between the mean metallicity of a stellar population 
and its Mgg line-strength index. At the moment all one can say is that there 
is no indication in Figure 4.42 that the observed Mgo(c.) relation flattens 
at 0, 2 oz as equation (5.49) predicts the corresponding relation for Z,(c.) 
does. 

Clearly, the analysis just given should apply with minor modifications 

to the radial line-strength gradients in elliptical galaxies that were discussed 
in §4.3.2. Differentiating equation (5.45) we find 


dln Z, 7 r de 


dinr = 1+cdr' yeu) 
Now from equation (5.47) we have that 
€sN 
ce. 5.51 
BO) oc 


and on differentiating this and substituting the result into (5.50) we conclude 
that 


dinZ, oc v2 

dlnr l+c® , (5.52) 
_ _e@sn_ UE 
~ esn — ® &’ 


where v. = \/rd@®/dr is the circular speed. For plausible values of ® and 
€sn, equation (5.52) predicts gradients of the same order (—0.2) as those 
observed [see equation (4.21)]. Unfortunately, the appropriate values of ® 
and esn are uncertain, because the extent of dark halos around ellipticals is 
unclear, as is the importance of radiative cooling for the early dynamics of 
the hot phase of an elliptical’s ISM. 
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5.3.3 The accreting-box model 


In 88.4 we shall encounter evidence that galaxies such as our own are ac- 
creting gas. We now investigate an important consequence of such accretion, 
namely a tendency to make metal-poor stars much rarer than they would 
otherwise be. : 

Suppose that initially the total mass of gas and dust in our box is very 
small, and imagine that during some initial period the chemical evolution 
proceeds according to the standard closed-box model. At the end of an 
appropriate period, 90% of the gas will have been converted to stars and, 
as we saw at the end of §5.3.1, roughly half of all stars will be more metal- 
poor than a third of the current interstellar abundance. Now feed metal-free 
gas into the box at precisely the rate at which the box turns gas into stars. 
Imagine a small additional mass 6M of primordial gas joining the box. In a 
steady state, an exactly equal mass becomes locked up in stellar remnants, 
and a mass p6M of freshly manufactured heavy elements is returned to the 
interstellar medium. Thus the overall effect on the interstellar gas is to 
remove mass 6M at metallicity Z and to return the same mass at metallicity 
p. Consequently, if we continue to feed the box gas at exactly the rate at 
which it is locking material up in stars, the metallicity of the interstellar 
medium will eventually settle to the value Z = p. After a sufficiently long 
time, most of the stars in the box will have metallicity Z ~ p, and the 
fraction of low-metallicity stars will be negligible. 

This qualitative picture can be formulated mathematically. If the total 
mass of material M; in the box varies, equations (5.34) and (5.35) remain 
valid. However, we now have 


OF 6M, = 6M, + 6Meg. (5.53) 
If we eliminate 6M); and 6M, between equations (5.34) and (5.35), we obtain 


6Z = rag — Z)6M, — pdMgl. (5.54) 


When we divide this equation by 6M;, we obtain a differential equation for 
the evolution of 7: 


dZ 1 dM 

= -Z- S| 5.55 

dM, M, E s | oe) 
A change of variable enables us to write this equation in a particularly simple 

es dZ dlnM dM 
n 
i ey [eS = a 5.56 
dae ( du ) a ol 8 69-08) 


The general solution is 


Z=p (1 —Ce"* — of oo eau) : (5.57) 
0 
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The simplest solution describes the case in which the gas mass M, is 
constant and the initial metallicity of the gas is zero. Then 


ZS oft — exp (: - x )| (Mg, = constant). (5.58) 
Mg 


Thus in this model, Z ~ p once M; > Mg. Clearly, the mass contained 
in stars that are more metal-poor than Z is M,(< Z) = Mi(Z) — Mg. By 
equation (5.58) this is predicted to be 


M,(< Z) = —-M, ln (: - =) (M, = constant). (5.59) 


The yield p can be determined from the current gas fraction Mg/M_(to) and 
the current metallicity Z(to) ~ Zo of the interstellar gas through equation 
(5.58). For example, in the solar neighborhood, M;(to) ~ 10M, and Z(to) ~ 
Zo, so by equation (5.58) we have p ~ Ze. Equation (5.59) then yields 


M,(< 0.25Z 0) © 0.3Mg ~ 0.03M: (M, = constant). (5.60) 


Thus this simple model predicts that about 3% of all dwarf stars should be 
more metal poor than 0.25Z6, which is only slightly in excess of the fraction 
of such stars found in the solar neighborhood (see §10.7.2). 

Various modifications of this simple accretion model are possible. Sup- 
pose, for example, that only a fraction 1 — q of the infalling gas is locked 
up in star formation, so that 6M, = qéM;,. Then one can show (see Prob- 
lem 5.13) that the number of low-metallicity stars is reduced below the value 
given in equation (5.60). 


5.4 Evolution of stellar populations 


Within a galaxy there are stars of every mass and frequently stars of widely 
differing ages. In this section we use the theory of stellar evolution to predict 
the observable properties of such stellar agglomerates. Even with the HST, 
colors and spectra can be obtained for individual stars only if they lie in the 
nearest galaxies; for most galaxies we can only analyze samples of radiation 
to which thousands or millions of stars make significant contributions. The 
question therefore arises, what can we learn from the colors or the spectrum 
of such integrated light about the nature of the stars in a galaxy? Presumably 
we will not be able to reconstruct the galaxy’s CM diagram from a single 
spectrum, but we might hope to estimate the mean metallicity of the system 
and perhaps constrain its age. Although a great deal of effort has gone 
into this problem over two decades, it is still not solved. Nevertheless, solid 
progress has been made, and several important results are available. 
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The simplest case is that in which all the system’s stars formed at the 
same time. Not only can we derive some interesting analytical results for this 
case of coeval stars, but, as we shall see in Chapter 6, it describes rather accu- 
rately the status of globular clusters. Moreover, we can regard a population 
with any history of star formation as a superposition of coeval populations 
of different ages, so once we know how a coeval population evolves it is a 
straightforward matter to determine the evolution of any other population. 


5.4.1 Analytical results 


We start by deriving some approximate analytical results that will tell us 
what to expect from detailed numerical models. From Table 5.2 we see that 
stars that are more massive than 1.25 Mo live less than 5Gyr. Thus any 
coeval stellar population older than ~ 5 Gyr will contain only stars of initial 
mass M; S 1.25Mo. Table 5.2 and Figure 5.2 show that these low-mass 
stars emit most of their light during the 1 Gyr or so that they are on the 
RGB, HB and AGB. In view of this result, it is not surprising that most of 
the integrated light from an old coeval population comes from evolved stars 
-— see Problem 5.9. Some important conclusions follow easily from this fact. 

The first conclusion depends on the fact that the effective temperature 
of the RGB, HB and AGB change only slowly with the masses of the stars 
on them for M; S 2Mg. Consequently, the intrinsic colors of an old coeval 
population should not have evolved strongly since the epoch, about 1.5 Gyr 
after its formation, when all stars with M; 2 2. Mo had died. 

By contrast, the luminosity of a coeval population will still be evolving 
(Tinsley 1972). To see this, let there be dN(M) stars in the population with 
mass in the range M to M+dM, and let the stars that turn off the MS at age 
t be of mass Mgp(t). Table 5.2 shows that for t 2 1.5Gyr, Meg $1.5 Mo, 
and that the time tgp that the star spends on the giant branch is much less 
than t. Hence if stars of mass M emit total energy Egp(M) when they are 
on the giant branch, the luminosity of the population may be estimated as 


Le (Fon) ae . 
Me 


aM dt (5.61) 


For the masses of interest, the MS lifetime of a star, ys, is related to its 
mass by 


-2.5 ' we 
ees 0 (a) , which implies “SPD. ( : ) - (5.62) 


dMcs | oa( Mes)" ( Mo ). (5.63) 
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We may fit the IMF (§5.1.9) € = dN/dM in the neighborhood of Mgp by 


a power law 
dN Mx 
— ~ k{ — . 5.64 
dM ( =| 3,64) 


In the solar neighborhood a ~ 2.5 for M S Mo. Substituting equations 
(5.63) and (5.64) into equation (5.61), we find 


K MoEas(Map) (Mas \** * 
Le ; (5.65) 
25 Gyr Mo 
Differentiating this expression yields 
dln L dln Ecp dln Mes 
= |——— 3.5 — —_—_—— 
dint Fes a)| dint 
aie (5.66) 
n LGB 
= 0.4a — {| 1.4+ 0.4————~—_ }.. 
- ( ie Tae) 


Eap probably depends only weakly on Map (0 < dln Egp/din Map < 1), 
so unless a > 3.5, the luminosity of an old coeval population is expected 
to be a decreasing function of time. In fact, detailed models by Tinsley 
& Gunn (1976) yield (dInZ/dInt) ~ 0.3a — 1.6, in good agreement with 
equation (5.66). Thus we can reliably predict the luminosity evolution of a 
coeval population if we can accurately determine a. 

Ostriker & Thuan (1979) pointed out that the mass of heavy elements 
that is produced by a coeval population is related in a simple way to the 
present luminosity L of the population and the slope of the IMF a defined by 
equation (5.64). Most of the present luminosity of the population comes from 
stars whose original masses lay in a narrow range around the characteristic 
mass Mogg of stars on the present giant branch, while the heavy elements 
generated in these systems were all contributed by stars more massive than 
some mass Mz > Map. Suppose these massive stars return a fraction f of 
their original mass to the interstellar medium in the form of heavy elements. 
Then the mass of heavy elements produced is 


es d 
My = eae dM. (5.67) 


If we take the IMF, € = dN/dM, to be of the form of equation (5.64) for 
M < Myax and zero for M > Max, we have 


Myax/ Mo 
My = fK M?, | MO) dM 
Mz/Mo 


fKM3,[ ( Mo\*? MeN oy 
~ an2 les, (i) I 
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If a > 2 and Mmax > Mz, we may neglect the second term in the square 
bracket of (5.68). Then, dividing (5.68) by equation (5.65), we obtain the 
mass of heavy elements produced per unit present luminosity as 


My = f Mo | 25 Gyr (2) (Mes 
Ecas(Mcs) Mo Mo : 


—- (5.69) 


Inserting into this equation reasonable values of the parameters f ~ 0.1, 
a ~ 2.4, Map ~ 0.85 Mo and Egg ~ 2.9 x 10° Lo yr, we have 


Mz 2—-a sf 
— ~ 0.26 Moe Ls: 5.70 
(324) Mots (5.70) 


It is less clear what value we should adopt for Mz: if we are interested 
in elements primarily produced in core-collapse supernovae, Mz = 8Mo 
is appropriate; if we are concerned with the abundances of elements such 
as C, N, and O, a smaller value Mz ~ 2.Mo is appropriate. Fortunately, 
for a ~ 2.4 equation (5.70) is not sensitive to Mz: with Mz = 8 Mo we 
find (M,/L) = 0.11M 6 LG", and with Mz = 2Mo we find (M,/L) = 
0.20 Me Le". 


5.4.2 Numerical models of population evolution 


A tremendous quantity of observational data for stellar systems of different 
types, observed at different stages in their evolution, is now available. Ade- 
quate exploitation of these data requires much more precise predictions than 
the simple analytical results that were derived in the last subsection. Since 
Beatrice Tinsley’s pioneering work in the 1970s, several groups have worked 
hard on numerical models of population evolution that use the best current 
knowledge of stellar evolution to predict the observable properties of a coeval 
population. We now take a look at the predictions of these models and their 
limitations. We start by considering predictions for broad-band colors. 

In §5.1.1 we described how apparent magnitudes and broad-band colors 
can be assigned to model stars. The same techniques can also be used to as- 
sign spectral indices (§3.4). Given these data, it is a straightforward matter 
to derive integrated magnitudes, colors and spectral indices by adding to- 
gether the contributions of all the model stars in a theoretical CM diagram. 

Charlot, Worthey & Bressan (1996) give a clear insight into the strengths 
and limitations of model populations through detailed comparison of results 
obtained with three independent modeling codes. They find that B —V 
colors are typically uncertain by ~ 0.03 mag, while V—K colors are uncertain 
by 0.1 to 0.14mag. They find that the prime causes of these uncertainties 
are differences in the assumptions made regarding the evolution of stars 
once they leave the MS. Smaller but significant causes of uncertainty are 
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Figure 5.19 The evolution of B— V, V — K, and Ty with time of a coeval population 
of solar metallicity that has a Salpeter IMF (a = 2.35). [From data published in Charlot, 
Worthey & Bressan (1996)] 


differences in the way colors are assigned to stars of specified position in the 
theoretical CM diagram. 

Figure 5.19 shows the predictions of Charlot, Worthey & Bressan (1996) 
for the evolution of B—V, V —K and the mass-to-light ratio Ty for a coeval 
population of solar metallicity. The population becomes steadily redder, but, 
in accordance with our analytical approximations, the colors change fairly 
slowly after ~ 5Gyr. Meanwhile, Ty increases at a roughly constant rate, 
just as our analytic work predicted.!® 

The full curve in Figure 5.20 shows the evolution within the (V — K, B— 
V) plane of the model population whose colors and mass-to-light ratio are 
plotted in Figure 5.19. Also shown in this figure are the colors of 12 el- 
liptical galaxies from the study of Peletier (1989); for each galaxy a filled 
triangle marks the central color, while an open triangle shows the color at 
r ~ 0.75R,. The open triangles are associated with substantial errors — the 


19 The absolute value of Ty in Figure 5.19 is determined by the arbitrary truncation 
of the IMF £(M) « M7235 at M=0.15 Mo. 
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Figure 5.20 The curve with squares shows the evolution of a coeval 
population of solar metallicity in the (V — K,B—-—V) plane. After 
2 Gyr the population has reached the lowest left square, and with 
each additional Gyr it moves up the curve by a further square; the 
last point corresponds to 17 Gyr. The triangles show central (filled) 
and peripheral (open) colors of 12 elliptical galaxies — the two points 
of each galaxy are joined by a line. The cross at bottom right 
indicates the typical error in the positions of the open triangles. 
{The curve is from data published in Charlot, Worthey & Bressan 
(1996) and the galaxy data are from Peletier (1989)] 


cross at the bottom right of the figure shows typical error in the locations 
of these triangles. The error in the location of any full triangle should be 
smaller and correlated with the error in the location of the associated open 
triangle. When these errors and those in the location of the full curve are 
borne in mind, it is apparent that the galaxy data are consistent with the 
models. Moreover, there is a strong suggestion that the galaxies have a non- 
negligible spread along the full curve. Does this imply that there is a spread 
of several billion years in the ages of these galaxies’ stellar populations? 

Notice that the full curve in Figure 5.20 implies that a spread in age of 
+3 Gyr around an age of 13 Gyr would be only barely detectable with data 
of the quality plotted in the diagram. Thus, if the spread of the triangles in 
Figure 5.20 does arise from age differences, the latter have to be substantial 
and, by the lower panel of Figure 5.19, be associated with variations in Ty 
of at least a factor 2. 

Figure 5.21 suggests that age differences do contribute significantly to 
the spread in galaxy colors by showing a two-color distribution for 34 el- 
liptical galaxies. The color indices plotted, (V — B)~*! and (U — B)~?}, 
are not measured colors, but colors corrected for the color-magnitude effect 
(§4.3.4) to absolute magnitude Mp, = —21. This correction has the effect 
of producing a highly linear distribution of points within the two-color plot. 
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The key fact about Figure 5.21 is that the two bluest galaxies are systems 
that show clear morphological evidence of having recently emerged from a 
violent merger event — the points belonging to these systems, NGC 3921 and 
NGC 7252, are marked with squares in Figure 5.21. As we saw in 84.6.1, 
there can be no doubt that within these systems there are many stars that 
are S 1 Gyr old, so the blue colors of these galaxies surely reflect a relatively 
young mean age of the systems. In view of this finding, it is natural to sup- 
pose that galaxies that fall just above these merger remnants are also blue 
because they are comparatively young. Schweizer & Seitzer (1992) use this 
approach to estimate the epochs in the past when a sample of elliptical and 
SO galaxies emerged from a major merger. 

Unfortunately, the conclusion that we reached above, namely that anoma- 
lously blue colors imply youth, is insecure because the colors of an young pop- 
ulation can be accurately mimicked by a somewhat older but less metal-rich 
population. Quantitatively, Worthey (1994) finds that 


Olnt ) 3 
~-§, (5.71) 
(Sinz colors 


with the result that a 20% increase in Z can be compensated for by a ~ 30% 
decrease in t. Therefore from colors alone one cannot hope to determine the 
age or metallicity of a stellar population. 

Can we do better if we have a good spectrum of the population rather 
than just its broad-band colors? Consider first galaxies with measurable 
emission lines. In §8.1.3 we shall see that the rate at which the stars of 
a disk galaxy emit ionizing photons can be determined from the strength 
I(Ha) of the Ha emission line at 657 nm in the galaxy’s spectrum. Ionizing 
photons are produced only by hot stars, and most hot, luminous stars are 
short-lived. Hence, a measurement of J(Ha) allows one to estimate the 
rate at which massive stars are forming. The continuum radiation against 
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which the Ha line is observed receives important contributions from longer- 
lived stars, so its intensity is a measure of how many stars there are in 
the galaxy and is insensitive to the galaxy’s star-formation history. From 
these considerations it follows that the equivalent width (§3.3.1) EW(Ha) of 
the Ha line is sensitive to both the IMF, which determines what fraction of 
each generation of stars will produce ionizing radiation, and the present star- 
formation rate, SFR. The difference between galaxies in which star-formation 
is still an important process and ones in which it has effectively ceased, is 
usually quantified by the birthrate parameter b 


b= ee (5.72) 

(SFR) 
where the angle brackets imply an average over the galaxy’s lifetime. Fig- 
ure 5.22 shows EW(Ha) as a function of b for various models. The squares 
are for models that have a Salpeter IMF [equation (5.14)], while triangles 
are for models with the Scalo IMF [equation (5.16)]. The impact of the 
larger numbers of massive stars predicted by the Salpeter IMF is apparent. 
All models with 6 < 1 have star-formation rates that decline exponentially 
with time. Two types of models with b > 1 are plotted: the open points 
are for models in which a current burst of star formation is superimposed 
on a constant background star-formation rate, while the filled points are 
for models in which the star-formation rate rises exponentially towards the 
present epoch. While the relation between 6 and the equivalent width of Ha 
is not independent of star-formation history, it depends most sensitively on 
the adopted IMF. 

Consider now what can be learned from the absorption-line spectrum 
that a typical stellar population produces. In practice integrated galaxy 
spectra never have high spectral resolution because the random velocities of 
stars within the population give rise to significant spectral broadening that 
washes out all narrow spectral features — see Chapter 11. In §3.4 we defined 
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Figure 5.23 The metallicity Z implied by a given value of Mge 
depends upon the age of the stellar population. Here Mg is plotted 
against age for several values of Z. The error bar at upper left 
indicates the uncertainty in this calibration that is introduced by 
allowing Y to vary from 0.26 to 0.32. [From data published in 
Casuo et al. (1996)] 


several spectral indices that are designed to measure prominent spectral fea- 
tures in medium- and low-resolution spectra (Table 3.8). The first step in an 
analysis of an integrated spectrum is to determine from it the values of up to 
a dozen such spectral indices. The measured values of these indices depend 
on the stellar velocity dispersion within the galaxy and must be corrected to 
zero velocity dispersion before being compared with the measured indices of 
nearby stars of known chemical composition. 

We constrain the nature of a galaxy’s stellar population by comparing 
the measured index values with values determined for various model popula- 
tions. The procedure for predicting the values of spectral indices for a stellar 
population is very similar to the procedure described above for predicting 
the population’s colors: at each point in the theoretical CM diagram we 
identify a star in our library and add its measured spectral indices into the 
model’s indices with an appropriate weighting. Clearly the final predicted 
spectral indices will be subject to uncertainties similar to those which plague 
predicted colors. 

Figure 5.23 shows the dependence of one of the most widely studied 
spectral indices, Mg2, upon Z and the age t of a coeval stellar population. 
Evidently, a measured value of Mg2 does not uniquely determine either Z or 
t. How much better can we do if we measure several different indices? The 
Z-sensitivity of an index or color I is defined to be the value of the deriva- 
tive (Ologt/d log Z); at constant IJ. For example, equation (5.72) states that 
the Z-sensitivity of colors is of order —1.5. Table 5.5 indicates the princi- 
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Table 5.5 Sensitivities of spectral indices defined by 
Faber et al. (1985) 


Index Dominant Others (A log t/O log Z) inaex 
G CH (0) =10 
Mg b Mg (C) -1.7 
Fe, Fe C, (Mg), Ca —2.3 
Fe. Fe (C), (Mg), Cr —2.8 
Na Na Cc —2.1 
HB 4H (Mg), (Cr), C —0.6 
CN CN C, N, (O) —2.0 
Mg, C Mg a8 
Mg. Mg Cc —1.8 
TiO, TiO Ti, (Fe) =e. 
TiO, TiO V, Sc, Ti 2.5 


NOTES: Elements with brackets around them in the 
“others” column contribute negatively to the index. 


SOURCE: Worthey (1996) 


pal sensitivities of the indices that are defined in Table 3.8. Each index is 
sensitive both to certain element abundances and to the population’s age 
t. All indices have significant dependence on more than one element, and 
this dependence is frequently in the sense that increasing the abundance of 
the element decreases the value of the index — in Table 5.5 this condition is 
indicated by brackets around the element’s symbol. In every case, increasing 
the overall abundance level affects the index in the same sense as decreas- 
ing the population’s age — this fact accounts for the negative signs in the 
last column of the table, which gives values for the derivative (5.71) at con- 
stant index. Several of these Z-sensitivities are similar to the Z-sensitivity 
of broad-band colors [equation (5.71)|. These indices, which unfortunately 
include the widely measured Mg indices, cannot help one separate the effects 
of varying age and metallicity. Some other indices do have Z-sensitivities 
significantly different from —1.5 and in principle we may determine Z and t 
independently by combining these indices with either broad-band colors or 
indices with Z-sensitivities ~ —1.5. 

The simultaneous determination of Z and t from quantities that all 
depend on both ¢t and Z is, in practice, fraught with difficulty. There are 
two basic problems. First, the smaller the range that is covered by the Z- 
sensitivities of the quantities one employs, the more accurately each quantity 
must, be determined for given final uncertainties in t and Z. A second and 
more fundamental difficulty is that, as we have seen, it is unlikely that a 
stellar population can be characterized by a single metallicity Z — see §§6.1.5 
and 10.4.1. Rather, we expect the abundances of different elements relative to 
H to vary. Therefore the number of quantities to be determined is in reality 
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greater than two: at the very least we should aim to determine separate 
abundances for a and iron-peak elements in addition to the age t, and ideally 
we would test the predictions of the theory of stellar evolution by determining 
independent abundances for every significant element. Clearly, if it is difficult 
to determine unique values for two quantities from the available data, we 
have no hope of determining from those data unique values of three or more 
quantities. 

The ideal resolution of this difficulty would be to find an index that 
depended only on t — the Z-sensitivity of such an index would be zero because, 
formally, an infinite change in Z would be needed to compensate for an 
infinitesimal change in ¢. If an index of zero Z-sensitivity were available, 
we could at the outset determine the age of the population and then hold ¢ 
fixed while determining the relative abundances of the elements from other 
indices. The only spectral lines that have any prospect of being insensitive 
to metal abundances are lines of hydrogen. Table 5.5 shows that while HG 
is less sensitive to metallicity than any other spectral index, it does have 
a significantly non-zero Z-sensitivity. Jones & Worthey (1995) show that 
indices that measure lower-frequency transitions in the Balmer series, such 
as Hy and Hé, can have negligible Z-sensitivity. Unfortunately, to measure 
these indices higher-resolution spectra are required than those on which the 
system of Faber et al. (1985) is based, with the result that few observational 
values of these indices are currently available. 


Problems 


5.1 Use Figure 5.7 to calculate the fraction of the mass of a stellar population 
that is returned to the ISM within 10 Gyr. Assume that the population forms 
with a Salpeter IMF, and adopt a lower mass cutoff of 0.1 Mo. 


5.2 Show that the mean molecular weight of a fully ionized H/He plasma 
lies near 4 = 0.58 for a primordial abundance of He, and near 0.61 for the 
solar abundance of He. 


5.3 Use the virial theorem to estimate the thermal energy in the Sun (assum- 
ing that the Sun has uniform density). Hence estimate the Sun’s Helmholtz 
time — the time it would take to cool by a factor two in temperature if the 
nuclear energy release in it were to cease. 


5.4 Show that when a self-gravitating body of polytropic gas shrinks homol- 
ogously, its thermal energy scales with the stellar radius R as Ey, « R30-7), 
where ¥ is the a ratio of principal specific heats. Hence show that a polytropic 
star is unstable to gravitational collapse if y < 4. 


5.5 Check the crude estimate of main-sequence lifetimes given by equa- 
tion (5.6) against the data plotted in Figure 5.4. 
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5.6 Use the virial theorem to estimate a typical value for the speed of sound 
deep inside a star of mass M and radius R. Using this estimate, show that 
the time required for sound to cross the star is P ~ (Gp)~1/?. Hence show 
that, for the fundamental p-mode of a star, the pulsation constant Q that is 
defined by equation (5.18) takes a value of order 1 hr. 


5.7 Given that the luminosity Z of a massive H-burning star varies with 
mass roughly as L « M23, show that, at fixed effective temperature, the 
fundamental pulsation period of such stars scales as P x L7/}2, 


5.8 Suppose the star-formation rate in a population varies with time t as 
e*, and let the oldest star in the population have formed at t = 0. Show 
that the analog of equation (5.12) for this case is 


tae’ 


$o(M) = 7 8(M). (5.73) 


5.9 For a population forming at time t = 0 with a Salpeter IMF with upper 
mass cutoff at M = 2 Mo, estimate the time after which most light comes 
from post-main-sequence stars. 


5.10 Suppose that the yield p varies with metallicity Z as p(Z) = 0.002 + 
0.6Z. Show that the ‘effective yield,’ defined by equation (8.62), is then given 
by 
0.6Z 
= ——_-—_—. 5.74 

Peft = In(1 + 3002) etd) 
‘5.11 Consider a protogalactic cloud of gas that contains as much H and He 
as the present Milky Way and that occupies a similar volume. Assume that 
the cloud is homogeneous. Show that the cloud is optically thin to Thomson 
scattering. Estimate the supernova rate that would be required to maintain 
the cloud at its virial temperature. 


5.12 (a) By assuming that the square of the gas density within a forming 
elliptical galaxy is everywhere proportional to the star density (p% = Kp,x 
~— see §8.3), express the rate of radiative cooling Q. of this phase in terms 
of an integral over p,. (b) By assuming that the presently observed stars 
formed at a uniform rate over a period of 7 Gyr, and that over this period 
one Type II supernova exploded per 200 Mo of newly-formed stars, obtain 
an expression for the rate of heating Qy of the ISM by supernovae. (c) Show 
that if the ISM is isothermal, then Q,./Qy « K. (d) Consider the instant 
at which the gas mass is a fraction a@ of the stellar mass. Use the results of 
§4.3.4 to show that at this instant Q./Qn « Rz??. (e) Explain within the 
context of the leaky-box model why this result, taken with the metal-poverty 
of low-luminosity ellipticals, implies that radiative cooling cannot have been 
important for the early evolution of massive ellipticals. 
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5.13 In this problem we investigate a more elaborate version of the accretion 
model presented in §5.3.3. We assume that initially all of the disk mass is 
in metal-free gas, and that a constant fraction (1 — q) of each infalling gas 
parcel 6M, is locked up by star formation. Thus the corresponding change 
in gas mass is 5M, = q6M¢. 

(a) Using equation (5.57), show that a parametric solution for Z(Mg) is 


Z=p(l—q)(l—e") ; Mg = Megoe™, (5.75) 


where Mgo is the initial gas mass. 
(b) Show that the ratio of the stellar mass at t; to the mass in gas at the 
present time fo is 


M,(u1) = 1-— q 
M,(uo) q 


(et — 1)e“20, (5.76) 


where wu; is the value of the parameter u at time t;. Explain why this relation 
together with observations of our galactic disk imply q < 1. 

(c) Now consider the case up >> 1, and let u; be an epoch at which the 
metallicity Z, was substantially lower than Zp). Show (i) that the present 
metallicity Zo ~ p(1 — q); (ii) that uy ~ —In(1 — 21/Zo) « 1; (iii) that 


M(u1) 


Melug) = (1- a) -m0, (5.77) 


0 


This formula differs from that obtained for the simple accretion model of 
§5.3.3 [equation (5.59)| only by the presence of the factor e~%“°. Since this 
factor can take any value from 0 to 1 depending on q and up, it follows that 
in this modified accretion model the fraction of low-metallicity stars can be 
made arbitrarily small. 


5.14 Figure 5.19 shows the evolution of Ty for a population with a Salpeter 
IMF with lower mass cutoff M = 0.15 Mo. How should the Ty scale be 
changed to make the figure appropriate for mass cutoff M = 0.015 Mo? 


5.15 Every entry in the last column of Table 5.5 is negative. Hence, show 
that the spectral indices of a coeval population all increase as the population 
ages. [Hint: you may wish to use the formula (22), = —(24)./(22)y, where 
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Star clusters 


In addition to a smooth distribution of stars, the Galaxy contains numerous 
agglomerations of stars. The study of these star clusters has played a 
key role in the development of our understanding of the Universe. Clusters 
vary enormously in the compactness of their structure, their luminosities 
and their metallicities. The more compact, luminous and metal-poor objects 
are classified as globular clusters, while the less compact, less luminous 
and more metal-rich objects are called open clusters. In this chapter, we 
present the wealth of information that has been gleaned from these systems. 


6.1 Globular clusters 


Globular clusters are roughly spherical agglomerations of stars found through- 
out the Milky Way (and, indeed, in other galaxies; §4.5) at positions ranging 
from near the Galactic center out to remote regions in the halo (§10.5.1). 
The study of these systems has played a key role in the development of our 
understanding of the Universe. William Herschel’s resolution of these diffuse 
objects into individual stars led him to speculate that more distant nebulae 
might also be made up of many unresolved stars, and, as we saw in Chapter 1, 
Shapley’s study of the distribution of globular clusters produced a revolution 
in our understanding of Galactic structure. Even today, as we will see below, 
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studies of globular clusters have a bearing on such fundamental questions as 
the age of the Universe and the manner in which the Milky Way formed. 
Since they are some of the simplest-looking structures in the Universe, glob- 
ular clusters also provide a useful test-bed for theories of stellar dynamics: 
if we cannot explain the properties of these objects, what hope do we have 
of understanding any of the more complex structures in the Cosmos? 

Quite a number of the Milky Way’s globular clusters are naked-eye 
objects which have been known since ancient times. Twenty-nine found 
their way into Messier’s catalog of objects that should not be confused with 
comets, and still more are listed in the NGC and IC catalogs.! These cat- 
alogs were compiled by eye, and so do not include the faintest and low- 
est surface-brightness clusters. During the twentieth century, some of these 
fainter systems were discovered serendipitously on photographic plates, while 
others were discovered in the course of systematic, large-area photographic 
sky surveys. The most famous such survey was conducted by Abell (1955), 
who discovered 13 low surface-brightness globular clusters from the Palomar 
sky survey (hence their designations Pal 1...Pal 13). Combining all these 
catalogs, we find the current census of globular clusters in the Milky Way 
numbers ~ 150 objects (Monella 1985, Harris 1996). The only clusters which 
will have slipped through this net either are very low surface-brightness ob- 
jects, or ones that lie very close to the plane of the Milky Way so that ob- 
scuration and confusion with field stars prevents their identification. Apart 
from these objects, the catalog of Galactic globular clusters is believed to 
be complete. Table 6.1 lists the physical properties of a number of the more 
widely-studied systems. 

Figure 6.1 gives an indication of the range of properties that we see in the 
Milky Way’s globular clusters. These clusters all lie at distances of ~ 10 kpc, 
so differences between the images mainly reflect intrinsic variations from 
cluster to cluster. Despite their clear differences, these systems all satisfy the 
defining requirements of a globular cluster: (i) they appear approximately 
circular in projection (and are hence presumably roughly spherical); and (ii) 
they seem to be purely stellar systems since no diffuse gas or dust is apparent 
in them. 

Although close to circular in appearance, their images are slightly ellip- 
tical. The cluster M19, which is shown in the bottom left panel of Figure 6.1, 
is amongst the most elongated of the Milky Way’s globular clusters with a 
minor-to-major axis ratio of b/a ~ 0.73. A study of 99 globular clusters by 
White & Shaw] (1987) found that they have a mean axis ratio of b/a = 0.93, 
with only 5% more elongated than b/a = 0.8. As we shall see in §6.1.11, 
this modest degree of elongation seems to be caused by mild rotation of the 
systems. 


lit is slightly unfortunate that the two brightest globular clusters in the sky, 47 Tu- 
canae (47 Tuc) and w Centauri (w Cen), are both located in the southern hemisphere; their 
strange designations reflect the fact that northern hemisphere astronomers, who could only 
observe these objects close to the horizon, mistook them for stars. 


Table 6.1 Characteristics of selected globular clusters 


Name Regal My Th log Spec. Fe HB.o} Comments 
NGC Other (kpc) (arcmin) (r:/rc) type 
104. 47 Tuc 7.3 —9.26 2.79 2.04 G4 —0.76 -0.99 Typical metal-rich cluster 
5272 M3 11.6 —-8.77 1.12 1.85 F6 —1.57 +0.08 Typical intermediate-metallicity cluster 
7078 M15 10.2 —9.07 1.06 2.50 F3 —2.22 +0.67 Typical metal-poor cluster. Collapsed core 
288 11.4 —6.54 2.22 0.96 F8 -1.24 +0.98 Same [Fe/H] as NGC 362, but has a blue HB 
362 9.0 ~—8.26 0.81 1.94 F9 -1.16 —0.87 Same [Fe/H] as NGC 288, but has a red HB 
1851 16.3. ~—8.26 0.52 2.24 F7 —1.26 —0.36 Same [Fe/H] as NGC 288,362, but bimodal HB 
5139 w Cen 6.3. —10.16 4.18 1.24 F5 —2...—1 40.90 Most luminous. Chemically inhomogeneous 
AM 4 24.2 —1.50 0.42 0.50 —2.00 +0.98 Least luminous. Least concentrated core 
6121 M4 6.1 —7.06 3.65 1.59 F8& -1.18 —0.06 Closest cluster (d m2 kpc) 
AM1,E1- 117.2 —4.60 0.50 1.23 —1.80 -0.93 Most distant cluster 
Liller 1 2.3 —7.42 0.45 2.30 +0.22 -1.00 Most metal-rich. Collapsed core 
5053 16.5 -6.64 3.50 0.82 —2.41 +0.52 Most metal-poor 


NOTES: Rg) is the radius in the Galaxy at which the cluster lies; r;, is the half-light radius; other quantities are as defined in the text. 
SOURCE: From data collected in Harris (1996). 
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Figure 6.1 Mosaic of four negative images of globular clusters. The name and concentra- 
tion class of each cluster is marked. Each image is 15 arcmin on a side, which corresponds 
to ~ 40pc at the distance of these clusters (~ 10kpc). [DSS images from the Palo- 
mar/National Geographic Society Sky Survey and the SERC/AAO Southern Sky Survey, 
reproduced by permission] 


It is clear from Figure 6.1 that globular clusters vary significantly in 
their luminosities. The brightest cluster in our galaxy, w Cen, has an abso- 
lute magnitude of My = —10.4, and star counts reveal that this luminosity 
arises from some ~ 10° stars brighter than V ~ 20. At the other end of 
the spectrum, the faintest clusters detected contain only ~ 10° stars with 
corresponding absolute magnitudes of ~ —3.0. As is described in detail in 
84.5 and §10.5.1, the number of globular clusters at absolute magnitudes be- 
tween these extremes follows an approximately Gaussian distribution with 
the most common luminosity corresponding to My ~ —7. 

Figure 6.1 also illustrates the variations in the radial distribution of 
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stars from cluster to cluster. Some clusters (like NGC 2808) are strongly 
centrally-condensed distributions of stars, while others (like Pal 7) are much 
more diffuse. These variations were first quantified by Shapley & Sawyer 
(1927), who defined twelve concentration classes designated by Roman 
numerals J. . XII. In this classification, class I clusters show the highest degree 
of central concentration and class XII clusters the lowest; the categories 
were defined such that similar numbers of clusters are found in each class. 
This crude and somewhat subjective measure of the structure of globular 
clusters has largely been superseded by more detailed analysis of the radial 
distribution of stars in these systems (see §6.1.10). 

More can be learned about the properties of globular clusters from their 
spectra. Early spectral studies of the integrated light from globular clus- 
ters revealed that they have spectra similar to those of individual stars, 
except that the spectral lines are slightly broadened by differences between 
the Doppler shifts of the cluster members (see §6.1.11). The integrated spec- 
tra differ in line strength from cluster to cluster, covering a range of spectral 
types from about F3 to G5 (Monella 1985). Since the integrated light from a 
globular cluster is made up from the contributions of many individual stars 
of different types, the interpretation of the resulting composite spectrum is 
not straightforward (Morgan 1959). From studies of the spectra and colors of 
individual giant stars in clusters, it is now recognized that the primary cause 
of the variations in spectral type is the difference between the metal abun- 
dances in clusters. Measurements show that globular clusters have metallic- 
ities which run all the way from very metal poor ([Fe/H} ~ —2.5; Geisler, 
Minniti & Claria 1992) up to solar or even greater (Frogel, Kuchinski & 
Tiede 1995). 

Although [Fe/H] provides a useful measure of metallicity, the use of a 
single parameter to quantify a globular cluster’s heavy element abundances 
is an oversimplification. Comparisons between high resolution stellar spectra 
and the predictions of model atmosphere calculations have allowed estimates 
to be made for many elements in globular cluster stars. It is found that 
individual abundances, such as [O/H], [Mg/H], etc., vary independently of 
[Fe/H] — for a review, see Wheeler, Sneden & Truran (1989). The most 
dramatic example of this phenomenon is provided by oxygen: in a typical 
globular cluster, the abundance of iron may be ~ 4% of the solar value, 
while the abundance of oxygen is more like ~ 10%. In the notation of equa- 
tion (3.19), these stars have an oxygen enhancement [O/Fe] ~ 0.4. Other 
a nuclides (§5.2.1) are also found to be enhanced. These enhancements are 
believed to arise because the a nuclides are produced in core-collapse super- 
novae from short-lived high-mass progenitors (see §5.2.3). As we shall see 
below, globular clusters formed early in the evolution of the Milky Way, and 
so the material from which they formed could only have been enriched with 
metals produced by such short-lived stars. 
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6.1.1 Globular cluster stellar photometry 


One of the simplest properties of a globular cluster that we can measure is 
the distribution of the brightnesses of its component stars. As we have seen 
in §3.5, the absolute brightnesses and colors of stars are directly related to 
their masses and ages. Thus, photometry of the stars which make up globular 
clusters contains important clues to the processes by which the stars in these 
systems formed as well as their subsequent evolution. 

In principle, it is straightforward to obtain photometry of stars in: glob- 
ular clusters. Since globular clusters are compact systems, many stars fit 
within the field of view provided by a single CCD frame, which makes it 
much simpler to obtain the requisite data than is the case when measuring 
the magnitudes of more diffusely distributed field stars. Further, since all 
the stars in a cluster are at essentially the same distance from us (a globular 
cluster is typically a few parsecs across at a distance of a few kiloparsecs), we 
need no distance information to measure the relative luminosities of different 
cluster stars, and we only need to determine a single distance to calculate 
the absolute magnitudes of all the cluster members. 

There are, however, some practical difficulties. In particular, although 
most well-studied globular clusters are located away from the Galactic plane, 
they are still subject to some interstellar extinction. Since globular clusters 
subtend only small angles on the sky, and are themselves almost entirely 
devoid of dust and gas, all the stars will be viewed through comparable 
amounts of absorbing material, and so the amount of absorption can be 
characterized by a single visual extinction. Fortunately, as we will see below, 
the color-magnitude diagrams of the stars in globular clusters contain a great 
deal of structure. By relating this structure to the predicted intrinsic colors 
and absolute magnitudes of features in the CM diagram, we can solve for 
both the distance to the cluster and the amount of reddening. 

A rather less tractable problem in the study of clusters’ stellar pho- 
tometry arises from their compact nature. As we have seen in §3.2.3, the 
phenomenon of seeing degrades ground-based images of stars so that their 
light is spread over a finite disk on the sky. The projected number densities of 
stars in globular clusters can be so high that light from adjacent stars blends 
together. In fact, in the central regions of clusters, the stars are so tightly 
crowded together that individual stars cannot be detected at all. Even out- 
side the region within which individual stars are undetectable, crowding can 
produce two serious problems for studies of the photometric properties of 
cluster stars. First, we may still miss some stars entirely because they hap- 
pen to lie very close together on the sky. Moreover, the difficulty of deciding 
whether a given patch of light is due to one star or two increases not only with 
the noise level in the data but also with the magnitude difference between 
the two stars. Hence crowding causes more faint stars to go undetected than 
bright ones, especially if the data are noisy. Second, if we do misidentify two 
or more adjacent stars as a single source, the measured luminosity of such a 
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“star” will be larger than that of any of the real stars of which it is composed. 
These effects of stellar crowding, known generically as source confusion, 
distort the observed distribution of stellar luminosities in globular clusters. 
As we shall see below, there are many more faint stars than bright ones in 
globular clusters, so the impact of source confusion becomes more acute as 
we look to the properties of cluster stars at fainter magnitudes. 

One obvious solution is to restrict our attention to the outer regions of 
clusters, where the density of stars is sufficiently low that confusion ceases 
to be an issue. However, at large radii contamination of one’s sample by 
field stars can become a serious problem, and, as we will see in §6.1.10, 
the photometric properties of stars in the outer parts of a cluster may not 
be representative of the cluster as a whole. Fortunately, there have been a 
number of developments which have alleviated the problem of confusion in 
studies of the more densely-populated inner parts of globular clusters. With 
the introduction of high-quality digital data from CCDs and the massive 
expansion of available computing power, it has become possible to imple- 
ment sophisticated algorithms for disentangling observations of crowded star 
fields. Software packages such as DAOPHOT (Stetson 1987) have been devel- 
oped which measure the point-spread function (§4.2.2) from the stars in an 
image, and then measure the locations and brightnesses of all the objects 
in the field that have light distributions that match the point-spread func- 
tion. The contributions that the detected objects make to the total light 
from the cluster can then be subtracted from the original image to search 
for residual contributions from fainter stars which were imperceptible in the 
raw data. Although this procedure recovers the photometry of a crowded 
field much more efficiently than could be achieved by traditional techniques, 
some sources will still be lost. Software has therefore also been developed for 
adding artificial stars to the observed images. By seeing how efficiently the 
photometry package recovers the existence and brightnesses of such artificial 
stars as a function of their magnitude, it is possible to estimate the distorting 
effects of crowding on the observed distribution of stellar luminosities (e.g. 
McClure et al. 1985). More recently, computational power has increased to 
a point where it is practical to simulate complete images of globular clusters 
with any intrinsic distribution of stellar properties that we choose. These 
images can be analyzed in exactly the same way as the real data to see how 
well the derived photometric properties match up with those derived from 
real data. 

The second major weapon in astronomers’ battle against confusion has 
come from improvements in image quality. Refinements in telescope tech- 
nology have dramatically reduced the typical seeing (§3.2.3) at optical tele- 
scopes, so reliable photometry can be obtained closer to the centers of glob- 
ular clusters. Even more dramatically, the repaired HST produces images 
of stars that are a mere 0.1 arcsec across, so much more densely populated 
regions can be studied before confusion becomes an issue. Combining the 
imaging capabilities of HST with the crowded-field analysis techniques de- 
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Figure 6.2 The color-magnitude diagram for the globular cluster 
M3. Known variable stars are shown as open circles, and the prin- 
cipal sequences are annotated. [From data published in Buonanno 
et al. (1994) 


scribed above now allows us to probe stellar photometry in even the densest 
regions of globular clusters (e.g. Guhathakurta et al. 1996). 


6.1.2 Color-magnitude diagrams 


Once we have calculated the apparent magnitudes for the stars in a cluster 
in more than one color, it is straightforward to plot the CM diagram for the 
cluster stars. Figure 6.2 shows the CM diagram of the well-studied globular 
cluster M3. Comparison of Figure 6.2 with the CM diagram of the solar 
neighborhood (Figure 3.5) shows that while most stars are confined to well 
defined sequences in both figures, the two diagrams look very different. By 
contrast, the CM diagrams of several globular clusters shown schematically 
in Figure 6.3 all contain the sequences that are indicated in Figure 6.2: a 
main sequence (MS); a subgiant branch (SGB); a red giant branch (RGB) 
and a horizontal branch (HB). In addition many CM diagrams show an 
asymptotic giant branch (AGB) that is clearly distinct from the RGB. Below, 
we discuss the importance of the similarities and differences between these 
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Figure 6.3 Schematic illustration of the principal sequences in the 
CM diagrams for three globular clusters. The systems shown are 
a metal-rich cluster (47 Tuc), an intermediate-metallicity cluster 
(M3), and a metal-poor cluster (M15). [Sequences from Hesser et 
al. (1987) and Buonanno et al. (1994)] 


various sequences for our understanding of the properties of globular clusters. 


The main sequence and subgiant branch One striking difference be- 
tween a globular-cluster CM diagram and that for the solar neighborhood 
is that in the former the MS stops well short of the luminosity of the HB,? 
while in the latter it continues up to include some of the brightest stars in 
the Galaxy. In §5.1 we saw that these bright, upper-MS stars have short life- 
times. Hence one possible explanation for this difference is that, while the 
solar neighborhood contains young stars, all the stars in a globular cluster 
are old and therefore none of them lie on the upper MS. 

As Figure 6.2 illustrates, the SGB of a globular cluster is extremely 
narrow, and abuts the MS at a well defined turnoff point.? This struc- 
ture indicates that all the stars that originally lay on the MS just above the 
turnoff point are currently evolving off the MS en masse. This implies that 


2 A few stars, termed blue stragglers, are sometimes found above the turnoff point 
in the CM diagram — see §6.1.8. 

3 The precise definition of the turnoff point is the point at which the ridge-line of the 
MS/SGB runs vertically up the CM diagram. 
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all these stars have essentially identical ages, namely their MS lifetime. In 
view of this fact, it is natural to assume that all stars in the globular cluster 
have this same age. Certainly, the absence of significant numbers of stars 
evolving away from the MS below the turnoff point indicates that the cluster 
does not contain stars much older than the MS lifetime of the turnoff stars. 
The sharpness of the observed sequences places strong limits on the length 
of time over which clusters formed. If the stars formed over an extended 
period, then the observed SGB and RGB would be made up from superpo- 
sition of isochrones of different ages. Since the locations of these calculated 
sequences, particularly the SGB, shift significantly with age, the narrowness 
of these features in the CM diagram of clusters constrains the period of stel- 
lar formation in these systems to no more than ~ 2% of the clusters’ age 
(Stetson 1993). 

As discussed in §5.1.5, the locations of both the MS and SGB depend 
on the metallicity of the cluster: Figure 6.3 illustrates how these sequences 
in the CM diagram are observed to shift to the right, and how the SGB 
becomes flatter as we go from metal-poor to metal-rich globular clusters. 

Deep observations make it possible to trace the MS clearly over a number 
of magnitudes of brightness (see Figure 6.2). If we can accurately determine 
the location of the MS in the CM diagram, we can use this knowledge to 
measure the distance to a cluster by a technique termed main sequence 
fitting. The essence of the method is very simple: from the MS of nearby 
stars with known distances, we can establish a unique relationship between 
the color of a MS star and its absolute magnitude; using this relationship to 
predict the absolute magnitudes of MS stars in a cluster on the basis of their 
colors, and comparing this intrinsic brightness to the apparent magnitude of 
the stars then yields the distance to the cluster via equation (2.36). If we 
carry out this procedure for a whole sample of cluster stars, we are effectively 
shifting the MS in an observed CM diagram like Figure 6.2 up and down the 
vertical axis until it matches up with the MS in the CM diagram of nearby 
stars (Figure 3.5). The amount by which the scale has to be shifted is the 
distance modulus of the cluster. 

In practice, a number of factors complicate the MS fitting procedure. 
The most important of these factors is the low metallicity of many globular 
clusters. Since the location of the MS in the CM diagram depends quite 
strongly on the metallicity of the stellar population, the calibration of globu- 
lar cluster star magnitudes is usually tied to the absolute magnitudes of the 
metal-poor subdwarfs for which we have trigonometric parallaxes, and hence 
good distance determinations. Even with the vastly improved data available 
from Hipparcos, subdwarfs are sufficiently rare that reliable parallaxes are 


4 The apparent broadening of the sequence at the faint end is a purely instrumental 
effect: the uncertainties in magnitudes of the faintest stars detected become quite large, 
and so the error in the color B — V, the small difference between two large ill-determined 
numbers, becomes very large. 
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only available for ~ 20 suitable objects (Reid 1997), so the calibration is 
not all that firmly established. Also, as we shall see in §6.1.3, stars start to 
evolve to the right in the CM diagram before the turnoff point comes down 
to them. We must therefore be rather careful when determining distances 
by MS fitting to allow for possible age differences between the populations 
being compared. 


The horizontal branch The horizontal branches in the CM diagrams 
of globular clusters also differ dramatically from the equivalent feature for 
nearby stars. With no additional information, we would be hard pressed 
to conclude that the extensive HB apparent in Figure 6.2 is in any way 
related to the red clump in Figure 3.5. In fact, the differences between these 
features arise purely from the very different metallicities of M3 and the solar 
neighborhood: as Figure 6.3 illustrates, the HB becomes shorter and moves 
to the right in the CM diagram as we look to higher metallicity systems, so 
the metal-rich solar neighborhood contains only the vestigial red clump as 
its HB. 

The HB is also where the RR Lyrae variables lie in the CM diagram 
(§5.1.10). As was first noted by Oosterhoff (1939), there appear to be two 
distinct types of globular cluster RR Lyrae population: clusters with red HBs 
contain mostly RR Lyrae stars of Bailey type ab (§5.1.10) with periods of P ~ 
0.55 days; while those with blue HBs contain more c-type RR Lyrae stars, 
and the ab-type stars that they do contain have longer periods, P ~ 0.65 
days. These two types of cluster are designated Oosterhoff class I and II, 
respectively. This correlation between HB color and RR Lyrae period arises 
from the underlying dependence of both phenomena on cluster metallicity. 

Since the instability strip populated by the RR Lyrae stars lies roughly 
in the middle of the region spanned by the HB, its position provides a useful 
fiducial mark for defining the color of the HB: if a globular cluster’s CM 
diagram has a HB which contains N,,,; RR Lyrae variables, N, stars on the 
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Figure 6.5 Section of the CM 
diagram for the globular cluster 
NGC 1851. Note the concentrations 
of stars at both the blue and red 
ends of the horizontal branch. (From 
data published in Walker (1992b)] 


red side of the variables, and Np stars on the blue side, then a measure of the 
overall color of the HB is provided by HBeoi = (No — N-)/(No + Nvar + Nr) 
(Lee 1990). Figure 6.4 shows the expected general trend in this quantity with 
metallicity: the average HB color varies from very blue (HB,o) ~ 1) for the 
most metal-poor clusters to very red (HBcoi ~ —1) for metal-rich clusters. 
In fact, the highest metallicity clusters have such red HBs that they contain 
essentially no stars on the blue side of the RR Lyrae stars, so HB,.) saturates 
as a color indicator at its minimum value of —1. 

There are indications that the relation between HB,,.; and metallicity 
is not monotonic, in that the most metal-poor clusters ([Fe/H] < —2) are 
somewhat redder than slightly metal-richer clusters ~ an effect first noted by 
Renzini (1983). As Sandage (1990) has pointed out, this non-monotonic be- 
havior explains the dichotomy between the two Oosterhoff classes: a cluster 
with HB,o ~ 1 has such a blue HB that it contains almost no RR Lyrae 
stars. This absence of RR Lyrae stars means that clusters with metallicities 
in the range —2 < [Fe/H] S —1.75 cannot be classified using Oosterhoff’s 
scheme, so there is a gap in the range of values of [Fe/H] of clusters with 
Oosterhoff classifications. Since RR Lyrae period increases monotonically 
with the star’s metallicity [equation (5.26)], this gap also shows up as a min- 
imum in the period distribution; clusters on the metal-rich side of the gap 
form a group whose RR Lyrae stars have short periods, while those on the 
metal-poor side of the gap contain only long-period RR Lyrae stars. 

The trend in HB color with metallicity in Figure 6.4 is clear, but there 
is also a spread in colors at any given metallicity. The width of the relation 
is greater than the uncertainties in metallicity and color determinations, and 
thus there are intrinsic variations between the colors of globular cluster HBs 
at any given metallicity. For example, the clusters NGC 288 and NGC 362 
have essentially identical metallicities ({Fe/H] = —1.2-£0.1 in each case), yet 
NGC 288 has a very blue HB (HB...) = 0.98), while NGC 362 has a very red 
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Figure 6.6 Theoretically calculated isochrones showing how a stel- 
lar population with Z = 0.004, Y = 0.24 evolves away from the 
ZAMS (dotted line) in the CM diagram. Each isochrones is la- 
beled by its age. [From the calculations of Bertelli et al. (1994)] 


HB (HB.oi = —0.87). Also, it has long been known (e.g. Racine 1971) that 
there are cases in which the HB cannot be described by a single characteristic 
color because the population along the HB displays a bimodal distribution, 
with concentrations of stars at both the red and blue ends of the sequence — 
see Figure 6.5. 


6.1.3 Comparison with Theoretical CM diagrams 


In order to see what can be learned from the observed CM diagrams of 
globular clusters, we must compare them with the predictions of the stellar 
population calculations that were introduced in §5.4. As we have seen above, 
the morphology of globular clusters’ CM diagrams leads us to believe that all 
the stars in these systems formed in a single epoch, so the procedure adopted 
for calculating an appropriate model CM diagram is as follows: 

(1) Choose initial abundances for the chemical elements in the stellar pop- 
ulation. For heavy elements, the abundances can be matched to the ob- 
served line strengths in the spectra of stars in the cluster to be modeled. 
The relative overabundance of oxygen mentioned above is doubly impor- 
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tant in these stellar-structure calculations, since it affects the hydrogen 
burning efficiency of the star via the CNO cycle as well as altering the 
opacity of the stellar material. For the initial helium abundance, the 
primordial value of Y = 0.23 (§5.1.6) is usually adopted. Once these 
abundances have been fixed, we can calculate the initial locations of the 
stellar population on the ZAMS. 

(2) Evolve the population forward in time, by repeatedly solving the stellar 
structure equations for each star, and keeping track of the chemical 
evolution which occurs as elements are mutated by fusion reactions. 

(3) For each time step, calculate the luminosities and colors for all the 
stars in the population. The curve connecting all the stars in the color- 
magnitude diagram is called an isochrone from the Greek for “same 
time”, since it represents a snapshot of the properties of the stellar pop- 
ulation when they are all at an identical age. 


If our assumed initial chemical composition is correct, and we have 
performed the stellar structure calculations accurately, then the calculated 
isochrone of the right age should trace the populated sequences in the ob- 
served CM diagram. Comparing isochrones to observed sequences thus pro- 
vides a powerful tool not only for measuring the ages of globular clusters, but 
also for testing our understanding of the basic physics of stellar structure. 
Families of isochrones have therefore been calculated for a wide range of 
different metallicities, ages, and physical assumptions by researchers around 
the world, including vandenBerg & Bell (1985), Bergbusch & VandenBerg 
(1992), Chieffi & Straniero (1989), Bertelli et al. (1994) and the Yale group 
(Green, Demarque & King 1987; Chaboyer et al. 1995). Figure 6.6 shows the 
isochrones produced by one such calculation. Stars peel away from the MS 
one after the other, starting with the most luminous. Comparing Figures 6.3 
and 6.6, it is clear that isochrones for ages somewhere between ~ 10 Gyr and 
~ 20 Gyr can approximately reproduce a globular cluster’s MS and SGB. 

We can also determine the effects of varying the element abundances on 
the isochrones. Figure 6.7 shows what happens to the calculated isochrone 
for a system of age 14Gyr as Y and Z are varied. Changing Y from Y = 0.2 
(comparable to the primordial value — see §5.1.6) to Y = 0.3 (approximately 
the value appropriate for stars with solar abundances) shifts the MS down- 
wards by ~ 0.2 magnitudes and makes the SGB somewhat steeper. Increas- 
ing Z from Z = 0.0001 (appropriate for a metal-poor globular cluster such 
as M15) to Z = 0.006 (appropriate for a metal-rich cluster such as 47 Tuc) 
shifts the isochrones to the right by ~ 0.2 in B—V, and significantly changes 
the shape of the SGB. Comparing Figures 6.3 and 6.7, we can see that the 
changes in the isochrones with Z match the observed differences between the 
MSs and SGBs of globular clusters with a range of metallicities. The shift of 
the sequences to the right in the CM diagram with increasing Z arises prin- 
cipally from the line blanketing effects of heavy elements, which suppress the 
emission of blue light by metal-rich stars (§2.3.2). 
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Figure 6.7 Illustration of the effects 
of varying Y and Z on the shape 
and position predicted for the 14 Gyr 
isochrone. The line width indicates 
the metallicity of the isochrone: the 
heaviest lines are for Z = 0.006; 
the intermediate lines are for Z = 
0.001; and the lightest lines are for 
Z = 0.0001. The solid lines are for 

” a Helium abundance of Y = 0.2, 
glissitiistiii ls ni% i and the dotted lines are for Y = 0.3. 
0.2 0.4 0.6 0.8 1 1.2 [From the calculations of vandenBerg 
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Figure 6.8 Absolute bolometric 
magnitude of the brightest giant star 
as a function of the host globular 
cluster’s metallicity (Sarajedini & 

[ I Milone 1995). The line indicates the 
22. Near oa ee ee ee! relation predicted by stellar evolution 


is (F al ? ag calculations (Da Costa & Armandroff 
§ 1990). 


Ideally, we would carry out similar comparisons between the results 
from calculations of the later stages of stellar evolution and other features in 
the CM diagram such as the RGB and the HB. Unfortunately, as we have 
seen in §5.1, we are unable to model accurately the deep convective layers 
and mass loss in giant stars. Thus, the uncertainties in the calculations 
become large when trying to follow these later stages of stellar evolution, 
and the match between calculation and observation becomes rather poor. 
Calculations do predict the observed tendency for the color of the almost 
vertical RGB to depend on metallicity in the sense that the most metal-poor 
clusters have the bluest RGBs (see Figure 6.3). This dependence is once 
again attributable to the effects of line blanketing. However, the fit between 
theory and observation is poorer for the RGB than it is for the MS and SGB 
(Da Costa & Armandroff 1990). 
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As we look to still-later stages of stellar evolution further up the RGB, 
the discrepancies grow. Figure 6.8 compares the observed absolute magni- 
tude of the brightest star in a number of globular clusters of different metal- 
licities with the predictions of stellar-evolution calculations. It is clear that 
the match between theory and observation for these evolved stars is far from 
perfect. In particular, it is noteworthy that in most cases the brightest star 
is brighter than the theoretically-predicted maximum luminosity, whereas a 
simple argument leads us to expect the reverse: given the small number of 
stars near the top of any RGB and the speed of their evolution through this 
stage, we cannot expect to catch even one star right at the tip of the RGB. 
We should not be surprised at the inability of calculations to reproduce the 
luminosity of the tip of the RGB: the rapidity with which the luminosity L of 
a star evolves at this point implies that D is a very sensitive function of the 
star’s internal structure. Consequently, small errors in the assumed internal 
constitution of a star must lead to large errors in the predicted luminosity. 
A good understanding of the luminosities of these very bright stars would 
enable us to use them as standard candles for estimating the distances to 
other galaxies at quite large distances (vandenBerg & Durrell 1990). 

When a star in a globular cluster evolves to the tip of the RGB, it 
undergoes the helium flash (§5.1.3). This explosive phenomenon causes an 
almost instantaneous mass loss and a rearrangement of the structure of the 
star, which we have no hope of modeling in detail. It is thus not possible to 
follow the evolution of a star from the RGB on to the HB where it settles 
down to core helium burning. This transition is therefore treated in a semi- 
empirical manner. For a given cluster, stellar structure calculations allow us 
to calculate the approximate mass of a star at the very tip of the RGB, Mre, 
and the mass of the hydrogen-depleted helium core that it has acquired by 
this point, M,. A star marginally more massive than this will have already 
undergone the helium flash, after which it is assumed to have lost a mass 
AM from its atmosphere, to end up with a total mass Myp = Mpg —-AM. 
Its helium core is unaffected by the helium flash and still contains a mass 
M,. After the helium flash, a star is burning helium in its core, and hence 
soon arrives on the ZAHB (§5.1.2). At this point, the rate at which its 
structure changes slows to a point where we can again use stellar structure 
calculations to estimate the observable properties of the star and follow its 
subsequent evolution. 

As we have seen in §5.1.7, stars do not spend a very large fraction of their 
lives on the HB, so all the stars currently in this region of the CM diagram 
must have undergone the helium flash recently and hence have evolved from 
stars with very similar masses, Mpg. Hence if all stars currently on the HB 
had lost the same amount of mass AM, they would all arrive at the same 
point on the ZAHB. Moreover, stars do not move far along the HB during 
core helium burning, so the HB would be very short. The finite extent of 
cluster HBs (Figure 6.2) therefore implies that at fixed Mpg there must be 
a spread in values of AM and therefore of Myp. Rood (1973) modeled the 
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Figure 6.9 The HB morphologies for three simulated clusters with 
the metallicities indicated. All three clusters have the same age 
(13 Gyr) and helium abundance (Y = 0.22). Individual points 
show the predicted locations of stars which have undergone differ- 
ent amounts of mass loss as described in the text, and are currently 
evolving through their core helium burning phase. RR Lyrae stars 
are shown as open points. The line shows the location of the RGB 
from the isochrones in Green, Demarque & King (1987). [After 
Lee, Demarque & Zinn (1990) from data kindly supplied by Y.-W. 
Lee] 


probability distribution of Myg by 


(Mup — M.)(Mre — Mus) 


P(Mup) <4 x exp |- etug)-anal| Mae Ma Mie, 
0 otherwise. 
(6.1) 


This complex functional form is designed to ensure that stars neither gain 
mass nor lose part of their helium cores as they pass from the RGB to the HB. 
The quantity (Myp) is a measure of the average mass of the stars arriving 
on the HB, while o,, describes the dispersion in Myp (see Problem 6.1). By 
comparing Figures 6.3 and 6.9, we see that such a spread in Myp enables 
theory to reproduce the observed structure of the HB and its variation with 
metallicity. Typically, one requires Mpg — (Muyp) ~ 0.2Mo and oy ~ 
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0.02 Mg (e.g., Lee, Demarque & Zinn 1994). 

These HB calculations explain the variation in HB color with cluster 
metallicity illustrated in Figure 6.4: as for the MS and SGB, most of the trend 
is caused by line blanketing in metal-rich stars (§2.3.2). The simulations also 
follow the evolution of HB stars as they move away from the ZAHB, and show 
how the stars begin to populate the AGB(see Figure 6.9). 

Although this semi-empirical approach allows us to understand many of 
the features of the HB, its rather ad hoc formulation is unsatisfying. Why do 
some stars lose more mass than others in moving to the HB? The simple mass- 
loss model also fails to explain the differences in HB color seen in clusters 
with identical metallicities (see Figure 6.4), and does not reproduce bimodal 
HBs like the one shown in Figure 6.5. Clearly, we are still missing some of 
the physics that dictates the late stages of evolution in globular cluster stars. 


6.1.4 Globular cluster ages 


The discussion above shows that we understand the physics of stellar struc- 
ture to the extent that we can use it to explain many of the features observed 
in the CM diagrams of globular clusters. That being the case, we can use the 
comparison between models and observation to tie down some of the basic 
physical parameters of the cluster. Of these parameters, perhaps the most 
interesting is age. As some of the oldest objects in the Milky Way, globular 
clusters place one of the strongest lower limits that we have on the age of the 
Universe, while possible differences between the ages of clusters at different 
locations in the Galaxy may provide important clues as to how the Milky 
Way formed (§10.7). Intense interest has therefore been focused on using 
CM diagrams as a diagnostic of cluster ages. 


Turnoff point ages As we have discussed, the presence of a sharp turnoff 
point from the MS provides evidence that the stars in a globular cluster 
formed in a single epoch. Since the location of the turnoff point works its way 
down the MS as a cluster ages (see Figure 6.6), the absolute magnitude of the 
turnoff point, My(TO), provides a direct measure of a cluster’s age, which 
can be calibrated using isochrone calculations. Quantitatively, Bergbusch & 
VandenBerg (1992) have found that My (TO) is related to cluster age, t, and 
metallicity, [Fe/H], by 


My (TO) = 2.70log(t/Gyr) + 0.30[Fe/H] + 1.41. (6.2) 


To extract a cluster’s age from this equation, we first determine the apparent 
magnitude of the turnoff point. Then we find the cluster’s distance, either 
by MS-fitting (§6.1.2) or by using the cluster RR Lyrae stars as standard 
candles (§5.1.10), and use it to calculate the absolute magnitude of the turnoff 
point. Finally, we measure the cluster’s metallicity from spectra. Crudely, 
we can see from Figure 6.3 that clusters typically have a turnoff point at 
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--- 14 Gyr 

—— 16 Gyr Figure 6.10 A family of isochrones 
with ages as marked fitted to the 

—— 18 Gyr MS and SGB in the CM diagram 


of M15 (points). The adopted 
distance modulus in this fit is 

(m — M) = 15.4, and the color exceas 
is E(B — V) = 0.10. A residual color 
correction of 6(B — V) = 0.015 has 
also been applied. [After Durrell & 
Harris (1993)] 


My (TO) * 4; for a typical globular cluster of metallicity [Fe/H] = —1.5 (see 
Figure 10.32), this implies that t ~ 13 Gyr. 


Isochrone fitting The vertical nature of the isochrones at the turnoff 
point means that the exact value of My (TO) is not easy to measure observa- 
tionally, and we should be able to obtain more accurate results by matching 
cluster sequences to theoretical isochrones throughout the transition from 
the ZAMS to the RGB - see Figure 6.6. Given the uncertainties in models 
of stars on the RGB, we focus on the upper end of the MS and the SGB. 

The most straightforward way to exploit the shape of the combined MS 
and SGB as an age indicator is to fit the locations of both sequences directly 
to the predictions of isochrone calculations. The isochrone which minimizes 
the discrepancies between the observed and calculated sequences gives the 
estimated cluster age. By leaving the distance modulus as a free parameter 
in this fitting process, we can also solve simultaneously for the distance to 
the cluster. Figure 6.10 shows a typical example of this procedure, which 
compares the mean locus of stars on the MS, SGB and RGB of the globular 
cluster M15 to a family of isochrone. It is clear from this figure that a good 
match is achieved for an age of close to 15 Gyr. 

We must take some care when interpreting the apparent success of this 
fitting procedure. As we have seen in §5.1, there are still considerable short- 
comings in our understanding of stellar structure, particularly with regard to 
the treatment of convection. There are also still non-negligible uncertainties 
that arise from the conversion of model isochrones, for which we calculate 
luminosities and temperatures, into the observed CM diagram (§5.1.1). It is 
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Figure 6.11 Illustration of the 
distance-independent age diagnostics, 
AV and A(B — V), as calculated 
from the principal sequences in the 
CM diagram for a typical globular 
cluster. 


therefore not surprising that the calculated isochrones usually fail to match 
the observed CM diagrams of globular clusters exactly. For this reason it is 
customary to apply a correction to the isochrones in the form of a small color 
shift, 6(B—V), in order to obtain a good fit between theory and observation. 
With good isochrone models such as those illustrated in Figure 6.10, these 
shifts are now typically only 6(B — V) ~ 0.02. Nevertheless, since they are 
non-zero we must still be missing at least some of the detailed physics of 
these systems. Moreover, there is no reason why this residual error should 
take the form of a simple color shift for all stars, so we must be careful 
not to over-interpret the success or failure of different models to match the 
observations at this level. It is also worth noting that the color at which 
the turnoff occurs provides one of the major constraints in the isochrone fit 
procedure for tying down a cluster’s age, and so the systematic uncertainty 
in color can represent a significant systematic error in the estimated age — 
see Problem 6.2. 


The AV method One of the biggest problems in the procedures de- 
scribed above is that the derived age is strongly coupled to the calculated 
distance modulus: if an isochrone has a turnoff point that lies above the 
observed value for a cluster, we could improve the fit either by using an 
older isochrone or by increasing our estimate for the distance to the cluster. 
A 10% uncertainty in the distance to a cluster corresponds to an error of 
~ 0.2 in its distance modulus and hence the absolute magnitude scale for its 
CM diagram. Such an uncertainty in My(TO) yields a 20% error in the age 
inferred from equation (6.2). Similarly, a small uncertainty in the correction 
for extinction can lead to a sizeable error in the derived age. 

We can circumvent these problems by instead basing our age estimate on 
the quantity AV = My(TO)—My (HB), the difference in magnitude between 
the turnoff point and the horizontal branch in a cluster’s CM diagram (see 
Figure 6.11). The brightness of the horizontal branch depends little on the 
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mass of stars which populate it (and hence the age of the cluster) while 
the brightness of the turnoff point decreases quite rapidly with time, and 
so the value of AV should provide a useful measure of cluster age. Since 
this parameter depends only on the relative brightnesses of these features, 
it is independent of the adopted distance modulus. The clear definition of 
the turnoff point and horizontal branch in the CM diagram means that they 
can be identified without calibrating the absolute color of the cluster, so the 
measurement of AV is not compromised by uncertainties in absolute colors 
due to reddening. Further, since these features have comparable intrinsic 
colors, any reddening will affect the absolute magnitude of both by the same 
amount, and so we do not have to correct AV for any differential reddening 
effects. Clearly, AV provides a very robust diagnostic for investigating the 
ages of globular clusters; its only negative characteristic is the difficulty in 
specifying My (TO) mentioned above. Combining equations (5.4) and (6.2), 
we find that 


AV = 2.70 log(t/Gyr) + 0.13[Fe/H] + 0.59. (6.3) 


Hence AV depends strongly on the age of the cluster, and only weakly on 
metallicity® (see Problem 6.3). 

The values of AV derived for globular clusters are all remarkably similar: 
a compilation of data for 43 clusters by Chaboyer, Demarque & Sarajedini 
(1996) found that AV has a mean value of 3.55, and a standard deviation 
from cluster to cluster of only ~ 0.15. The small standard deviation is close 
to what we would expect from the measurement errors alone, so there is 
very little intrinsic variation in this quantity between clusters (but see also 
§6.1.6). Since the globular clusters in this sample have metallicities in the 
range —2.5 < [Fe/H] < —0.5, equation (6.3) implies that their ages are 
~15+1Gyr. 
The A(B —V) method Other distance-independent properties of the 
CM diagram have also been employed as cluster age estimators. Sarajedini & 
Demarque (1990) advocate the use of the quantity A(B—V) = (B—V)scp— 
(B —V)ro, the difference between the color of the point at which the SGB 
turns up toward the RGB and the color of the turnoff point (see Figure 6.11). 
Like the AV method, this approach does not depend on the adopted distance 
modulus or reddening correction. It has the further advantage that A(B—V) 
turns out to be almost independent of metallicity, since both (B—V)scp and 
(B —V)ro vary with heavy-element abundance in almost the same way. A 
similar technique has been developed by vandenBerg, Bolte & Stetson (1990), 
who measure the color and magnitude of the base of the giant branch relative 
to both the turnoff point and the MS of a cluster’s CM diagram. 


5 The amount of variation in AV with metallicity is dictated by the still-disputed 
metallicity dependence of My (HB) (Bounanno, Corsi & Fusi Pecci 1989, Chaboyer, De- 
marque & Sarajedini 1996). 
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The principal disadvantage of methods that compare the colors of fea- 
tures in the CM diagram is that uncertainties in the physics of stellar struc- 
ture calculations affect colors far more than absolute magnitudes. The lumi- 
nosity of a star is fixed by the rate at which energy is generated by nuclear 
fusion in its core, which is a relatively well-understood process. The stellar 
color, on the other hand, depends on the effective temperature of the star, 
which in turn is set by its radius [see equation (3.15)|. The radius of the star 
is dictated by the outer structure of the star, which depends on the uncer- 
tain physics of convection and atomic opacities. The A(B — V) method is 
therefore more appropriate for comparing the relative properties of clusters 
to see if they differ (see §6.1.6) rather than for estimating absolute ages. It is 
nonetheless heartening that the observed values of A(B — V) are quite con- 
sistent with those calculated from isochrone models with ages of ~ 15 Gyr 
(Sarajedini & Demarque 1990). 


Comparison with the age of the Universe The techniques described 
above, together with various hybrid combinations of these methods, all pro- 
duce a fairly consistent picture showing that globular clusters formed some 
~ 15 Gyr ago. Quite how early globular clusters formed in the evolution of 
the Universe is an open question. However, the presence of quasars at red- 
shifts as high as z = 5 implies that massive, well-ordered structures formed 
early on, and the presence of carbon, nitrogen and oxygen emission lines 
in these quasars show that the stars which synthesize such elements must 
already have formed. If we allow ~ 1 Gyr for the formation of globular clus- 
ters (Sandage 1993a), we thus find that the Universe is ~ 16 Gyr old. This 
value conflicts with the age of the Universe as predicted by its cosmological 
expansion. As we shall see in Chapter 7, most estimates of cosmological 
parameters favor an age significantly less than 16 Gyr; it is only by adopting 
one of the most extreme acceptable cosmological models that we can obtain 
an age for the Universe as large as 16 Gyr. Thus, either we must be prepared 
to adopt these most extreme acceptable cosmological models, or we must 
reconsider the absolute globular cluster age determinations. 

The traditional view is that the weakness probably lies in the measure- 
ments of cosmological parameters, since there are still a number of internal 
inconsistencies in their derivation, while the age estimates of globular pro- 
vide a more coherent picture. However, the degree of consistency in globular 
cluster age estimates may be deceptive. The absolute calibration of all of 
these estimates is obtained from isochrone calculations. Thus, any error in 
these calculations will alter all the age estimates in a correlated manner. Un- 
certainties in basic physics such as the opacities of stellar atmospheres affect 
the properties of stars at all stages in their evolution, so age estimates that 
use different features in the CM diagram will all be affected by one incorrect 
physical assumption. Errors in different stages of the stellar structure cal- 
culations are also strongly correlated because of the evolutionary nature of 
the problem. If, for example, we had missed some important piece of physics 
which shortens the MS lifetime of all stars by At, then stars would reach 
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the MS turnoff and all subsequent points in the CM diagram At sooner than 
our calculations had predicted, but the locations of the sequences would be 
unaffected. Thus, all the methods for comparing data and observation would 
consistently indicate that the two matched, but the derived ages would all 
be out by At. We cannot anticipate what pieces of physics may be missing 
from the existing stellar structure calculations, nor how large an affect they 
may have on globular-cluster age estimates. However, the recent discussion 
of the effects of helium diffusion (§5.1), which may lower MS lifetimes by 
as much as At ~ 2 Gyr (Proffitt & Michaud 1991, Chaboyer, Sarajedini & 
Demarque 1992), should give some impression of the possible impact of phe- 
nomena which have yet to be incorporated. It is still quite possible that 
future changes in the modeling of globular clusters will lower the estimates 
of their ages to a point which is compatible with less extreme values of the 
cosmological parameters. 


Variations in age We have seen that globular clusters are all old, but 
are they all equally old? Since the uncertainty in individual cluster ages is 
at least 2 Gyr, this is not an easy question to answer. Fortunately, however, 
much of the uncertainty in age determinations takes the form of an unknown 
systematic offset in absolute age, so it is possible to estimate age differences 
between clusters with very much greater accuracy [see Stetson, VandenBerg 
& Bolte (1996) for a review of relative age determinations for globular clus- 
ters]. Since the uncertainties in isochrone calculations depend on errors in 
our treatment of such features as convection theory and opacities, which, in 
turn, depend on the metallicity of the system, we can further minimize the 
errors in calculating relative ages by comparing systems with similar metal- 
licities. For example, in the case of the equal-metallicity pair NGC 288 and 
NGC 362 (see page 338), the A(B — V)-like methods imply that NGC 362 is 
341 Gyr younger than NGC 288 (Sarajedini & Demarque 1990, vandenBerg, 
Bolte & Stetson 1990). Analysis of the cluster Pal 12, which has compara- 
ble metallicity (([Fe/H] = —1.1 40.2), imply that it is more youthful still, 
some ~ 4Gyr younger than NGC 362 (vandenBerg, Bolte & Stetson 1990). 
Similarly, M3 ({Fe/H] = —1.7 40.1) appears to be ~ 1 Gyr older than M13 
([Fe/H] = —1.7 + 0.1), although this measurement lies right at the lower 
limit of what can be determined from the relative colors of the turnoff and 
giant branch (vandenBerg, Bolte & Stetson 1990). It is notable that age 
differences seem to become smaller as we look to lower metallicity clusters. 
The AV method has also been applied to the measurement of relative 
cluster ages. The points in Figure 6.12 show the values of AV for a sample 
of globular clusters as a function of metallicity. The lines in this figure show 
the predictions of equation (6.3) for four cluster ages. Much of the scatter 
in the observed values can be attributed to observational uncertainties, but 
for given metallicity there does appear to be a real spread in AV consistent 
with variations in cluster ages of ~ 6 Gyr (Chaboyer, Demarque & Sarajedini 
1996). In Figure 6.12 the points do not scatter about any one of the sloping 
lines of constant age; if anything, AV decreases rather than increases with 
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Figure 6.12 Difference AV in V magnitude between the horizon- 
tal branch and the turnoff point as a function of metallicity for a 
sample of globular clusters. The lines show the values of AV as 
predicted for the ages shown. [From data published in Chaboyer, 
Demarque & Sarajedini (1996)] 


metallicity, so the most metal-rich globular clusters are, on average, several 
billion years younger than the metal poor clusters. 

This correlation could reflect a general trend in the cluster population, 
but it could also be explained if the Milky Way has tidally captured a few 
young, metal-rich globular clusters from the Large Magellanic Cloud — such 
clusters would populate the lower right-hand corner of Figure 6.12 (Lin & 
Richer 1992). 


6.1.5 Metallicities of globular clusters 


As we have seen above, most of the variations between the shapes of se- 
quences in globular cluster CM diagrams can be attributed to differences 
between the metallicities of clusters. Once again, the narrowness of the se- 
quences in a particular cluster imply that the cluster must be chemically 
homogeneous, as a range in metallicity at a particular evolutionary point 
would broaden the sequence dramatically. 

Direct observations of line strengths in stellar spectra generally confirm 
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this conclusion, but there are a few elements that provide notable exceptions. 
In particular, the fraction of O relative to Fe in red giant stars has been 
found to vary by more than a factor of ten within a single cluster (e.g. 
Kraft et al. 1992), and these abundance variations correlate with location 
on the RGB in the sense that stars higher up the sequence have lower O 
abundances (Kraft et al. 1993). Oxygen’s role in the CNO hydrogen burning 
cycle [equation (5.3)] can cause its abundance to be depleted in the cores 
of stars. Thus, one credible explanation for the low abundances of O in the 
atmospheres of some stars is that the CNO cycle has reduced the O content of 
their cores, and some mixing process has then dredged this depleted material 
up to the stars’ surfaces. In this picture the tendency for O depletion to 
increase as one ascends the RGB arises because stars further up the RGB are 
at more advanced evolutionary stages than those lower down the sequence, so 
their convective envelopes will have had longer to redistribute their oxygen- 
depleted cores into their envelopes. A test of this hypothesis is possible 
because the CNO cycle conserves the total amount of C, N and O: all O and 
C lost in the process is converted into N. Comparisons between data and 
model atmosphere calculations do, indeed, imply that although [O/Fe] varies 
significantly from star to star, [C+N+O/Fe] is approximately constant for all 
the stars in a single cluster (e.g. Dickens et al. 1991, Brown, Wallerstein & 
Oke 1991). However, the situation is not clear-cut, since there is considerable 
scatter in this relation, with stars from the same cluster at the same location 
in the CM diagram containing very different O abundances. This apparently 
random element in the depletion of O in red giant atmospheres is not yet 
understood. 

Other observed element anomalies are also puzzling. Cohen (1978) found 
that the abundance of Na varies by an order of magnitude in the giant stars 
of M3. On the basis of a very small sample of stars, Peterson (1980b) sug- 
gested that the Na abundance was anti-correlated with the O abundance, 
and this result was subsequently confirmed by Sneden et al. (1992). Sim- 
ilarly, Al abundance is also found to be anti-correlated with O abundance 
(e.g. Norris & Da Costa 1995). It is believed that proton capture reac- 
tions can significantly enhance the abundances of Na and AI in stellar cores 
(Langer, Hoffman & Sneden 1993), and so the same processes which dredge 
up oxygen-deficient material may also bring the sodium- and aluminum- 
enhanced material to the surface of the star, explaining the anti-correlation 
between these abundances. However, we should stress at this point that the 
process by which this mixing occurs is not understood: the convective zones 
predicted by stellar structure calculations do not reach deep enough into the 
stars’ interiors to drag material with modified O, Na and Al abundances to 
their surfaces. Some additional, as yet unknown, mixing process must be 
hypothesized if the observed chemical abundances are to be explained by 
nuclear processes within the present generation of stars. 


w Cen Other element abundance variations have proved undetectably 
small in all globular clusters bar one — w Cen, the most luminous cluster 
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in the Milky Way. In this cluster, the RGB has a large intrinsic width rather 
than forming the usual narrow sequence in the CM diagram, and it was early 
on suggested that this width might arise from a spread in chemical composi- 
tion amongst its red giant stars (Dickens & Woolley 1967). Direct confirma- 
tion of this hypothesis came from spectral observations of giant stars, which 
showed that those on the blue side of the giant branch have Fe abundances 
a factor of ten lower than those on the red side (Mallia & Pagel 1981). Sub- 
sequently, high-resolution spectra of stars in this system have shown that 
the abundances of a wide range of elements vary from star to star (Norris & 
Da Costa 1995). Although the abundances of heavy elements all track one 
another to some extent, the relationships are not always linear. For example, 
while the Fe abundance varies by a factor of ten within w Cen, the abundance 
of Ba varies by a factor of almost a hundred. These variations cannot be 
explained by the self-enrichment and mixing scenario described above, since 
many of the heavier elements which are observed to be enhanced in the red 
giants of w Cen are not produced in such low-mass stars. However, we also 
see the same correlations between C, N, O, Na and Al abundances that have 
been observed in other clusters. It therefore seems likely that the chemical 
mix in w Cen results from a combination of a strange primordial distribution 
of abundances and self-enrichment within the current generation of stars. 
Models can be cooked up to produce the required primordial abundances us- 
ing an earlier generation of massive stars (e.g. Norris & Da Costa 1995), but 
these ad hoc explanations beg the question of why w Cen is so different from 
the other globular clusters in the Galaxy. Perhaps the explanation lies in 
this cluster’s other unusual properties such as its large size and high rotation 
rate (§6.1.11). 


6.1.6 The third parameter problem 


We have seen that to a large extent we can explain the variations between 
cluster CM diagrams in terms of variations in metallicity [Fe/H] and age t, 
with metallicity the dominant factor. It has, however, proved impossible to 
account fully for variations in CM diagrams using only [Fe/H] and t. For 
example Catelan & de Freitas Pacheco (1993) have investigated the difference 
in age At between the equal-metallicity clusters NGC 288 and NGC 362 (see 
page 338). From the sequences near the turnoff they deduce At = 3 Gyr, 
while the HB morphologies are compatible with At being as small as this 
only if the clusters are both implausibly young (t S 10Gyr). Similarly, no 
combination of metallicity and age in standard stellar-structure calculations 
can reproduce the bimodal HBs seen in some clusters — see Figure 6.5. Hither 
there is some fundamental failing in our understanding of stellar evolution, 
or there must be some other factor beside metallicity and age which dictates 
the properties of globular clusters. In the literature this unknown factor 
is generally referred to as the second parameter because it was mooted 
before variations in cluster ages had been established. 
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Various candidate third parameters have been proposed, which we now 
discuss. 


Variations in helium abundance The earliest suggestion was that the 
initial helium abundance varies from cluster to cluster, as such variations 
can reproduce the anomalous horizontal branches seen in some metal-poor 
clusters (Sandage & Wildey 1967). Unfortunately, the helium abundances 
of globular clusters are not amenable to direct spectroscopic measurement 
since helium lines are only excited at high temperatures, whereas globular 
clusters are populated by old, cool stars. Therefore, less direct measurements 
of helium abundance must be employed. For example, stellar-evolution cal- 
culations have shown that the ratio between the time that a star spends on 
the horizontal branch of the CM diagram and the time it spends on the red 
giant branch, R = 7yp/TRGp, depends sensitively on the assumed helium 
abundance, but very little on the metallicity, age, or any other property of 
the star (Iben & Rood 1969). The number of stars in a cluster that we see 
at stage 7 of their evolution is N; = n;7;, where n; is the rate at which stars 
enter and leave that stage. Since stars leaving the red giant branch evolve 
directly on to the horizontal branch, nRgR = Nyp, and so R = Nyp/Npacs. 
Thus, by measuring the ratio of the number of stars on the horizontal branch 
to the number on the red giant branch of a cluster’s CM diagram, we have 
a method for determining the helium abundance of the cluster. Applica- 
tion of this method to a number of clusters shows that they have a mean 
helium abundance of Y = 0.23, with a dispersion from cluster to cluster of 
oy = 0.02 (Buzzoni et al. 1983). This dispersion is directly comparable to 
the errors inherent in the method, so the observations are entirely consistent 
with there being no real variations at all in helium abundance. The failure 
of such techniques to detect variations in helium abundance have led people 
to look elsewhere for the second parameter. However, it should be borne 
in mind that these indirect methods do not provide very accurate measure- 
ments of Y, so it is still possible that as-yet unobservable variations in helium 
abundance may have significant impact on the properties of globular clusters 
(Shi 1995). 


Variations in other element abundances More recently, attention has 
shifted to the abundances of the heavier trace elements. Particular attention 
has focused on C, N and O. These elements form the catalyst in the CNO 
cycle, which provides one of the channels by which H is burned into He in 
MS stars [equation (5.3)|. Increasing the abundance of these elements thus 
increases the efficiency of hydrogen burning, which decreases the time that a 
star spends on the MS. Further, the mixture of these trace heavy elements in 
a star’s atmosphere also affects its opacity, which in turn dictates the star’s 
structure. Thus, varying the abundances of C, N and O relative to Fe and 
other heavy elements can dramatically alter the appearance of a cluster’s 
CM diagram, and it has been suggested that the relative abundance of these 
elements to iron, [C+N+O/Fe], may vary from cluster to cluster. An ideal 
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test of the hypothesis that this quantity is the second parameter is provided 
by the equal-metallicity pair NGC 288 and NGC 362, since, as we have seen, 
their third parameters must differ. However, spectroscopic observations by 
Dickens et al. (1991) have shown that the two clusters have indistinguishable 
values of [C-+N+O/Fe], and so the third parameter phenomenon cannot be 
explained by these abundances. 


Other candidates The third parameter is something that affects HB mor- 
phology. We know from §6.1.3 that the amount of mass, AM, lost between 
the RGB and the HB is a primary factor in determining HB morphology. 
Does the distribution of the stochastic variable AM [see equation (6.1)] 
vary systematically from cluster to cluster? Clearly the cause of any such 
variation would be a third parameter of globular cluster CM diagrams. 

It is possible that a dynamical factor like rotation could explain the 
differences: the envelopes of faster-rotating stars will be less tightly bound 
than their slower-rotating equivalents, so we might expect that they would be 
more prone to mass loss. If the initial conditions from which a cluster formed 
favored the production of fast-rotating stars, we might therefore expect such 
a cluster to have a blue horizontal branch (Suntzeff 1981). However, the ab- 
sence of any correlation between HB morphology and the measured rotation 
rates of stars which populate the HB casts serious doubt on this explanation 
(Peterson, Rood & Crocker 1995). An alternative dynamical explanation is 
suggested by the analysis of Fusi Pecci et al. (1993), which showed that the 
HB morphology is correlated with the spatial distribution of stars in the clus- 
ter (see §6.1.10) in the sense that more centrally-concentrated clusters have 
bluer HBs. Perhaps, therefore, the blue colors of some horizontal branches 
are related to stellar encounters in clusters of higher-than-average density. 

As more high-quality observations of globular clusters have been made, 
the constraints on evolutionary models of these systems have become tighter 
and tighter. It is therefore not entirely surprising that the number of param- 
eters required to describe their properties satisfactorily has also increased. 
Metallicity and (to a lesser extent) age are established as quantities that 
vary from cluster to cluster, but there must also be other internal factors 
that affect their properties. As we shall see in §10.5.1, cluster properties also 
depend on external factors such as their location within the Galaxy. The full 
story of globular-cluster CM diagrams has yet to be unraveled. 


6.1.7 Luminosity functions 


As well as examining the regions of the CM diagram that are populated by 
globular cluster stars, it is also instructive to see how many stars occupy 
each region: the number of stars of different types that we see today tells 
us about the efficiency with which the different stellar types were produced 
and the way in which these stars subsequently mutate as they evolve. The 
simplest observational measure of the frequencies of different stellar types 
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is the luminosity function of the cluster stars (§3.6.3). Measuring the lu- 
minosity function is a more difficult task than the construction of a useful 
CM diagram since source confusion plays a more damaging role: if we were 
to fail to detect some fraction of the stars shown in Figure 6.2, we would 
still be able to pick out the principal sequences in the diagram; however, we 
would obtain a totally meaningless luminosity function from these data if 
their incompleteness varied with magnitude. It is only by carefully quantify- 
ing any incompleteness due to crowding that we can calculate the luminosity 
function. 

Early analyses were based on photographic plates without the benefit 
of sophisticated crowded-field photometric techniques. They therefore con- 
centrated on measuring the luminosity function of only the relatively-scarce 
bright stars for which source confusion is not a serious problem. Since the 
most luminous stars are also the ones that evolve most rapidly, we would 
expect this bright end of the luminosity function to be the part of the dis- 
tribution of luminosities most heavily affected by stellar evolution. This 
expectation is borne out by the observations: Figure 6.13 shows the signifi- 
cant difference between the luminosity functions for bright stars in a typical 
globular cluster and bright stars in the disk of the galaxy (§3.6.3). The 
globular-cluster luminosity function shows a deficit of stars in the range 
1S My <4. This deficit occurs because stars in this luminosity range 
in globular clusters are subgiants and giants; these stars pass through this 
evolutionary stage very rapidly compared to the MS stars of comparable lu- 
minosity, so our “snapshot” view of a single point in the life of a globular 
cluster catches relatively few stars with these luminosities. The excess in the 
globular cluster luminosity function in the range —3 S My <0 reflects the 
well-developed RGBs of clusters and the small number of MS field stars at 
such high luminosities. The bump at My ~ 0 in Figure 6.13 is produced 
by the globular-cluster stars on the HB, which places a lot of evolved stars 
at very similar magnitudes (§6.1.2). Clearly, the shape of the bright end of 
the globular-cluster luminosity function is dictated by the processes of stellar 
evolution. 

As we saw in §6.1.2, at magnitudes fainter than My ~ 4, most of the ob- 
served stars in globular clusters have yet to evolve away from the MS, so their 
properties more closely reflect the initial stellar content of the cluster. These 
stars are fainter and more abundant than the evolved stars discussed above, 
so the analysis of their luminosity function has had to await the improved 
techniques for dealing with source confusion discussed in §6.1.1. During the 
1980s, deep CCD images were obtained for a number of nearby clusters, and 
these data were analyzed with crowded-field photometry packages in order 
to calculate the luminosity functions of these systems down to magnitudes 
of My = 9 and beyond — well into the part of the function made up from 
MS stars. The luminosity functions for a sample of clusters is presented in 
Figure 6.14. It is apparent from this figure that the MS portion of globular- 
cluster luminosity functions is not universal, but varies systematically with 
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cluster metallicity. 

The dependence of the luminosity function on metallicity is not sur- 
prising: as we saw in §6.1.2, the luminosity of a star depends on the star’s 
heavy element abundance as well as on its mass, so it would be an amazing 
coincidence if the distribution of luminosities were identical for clusters with 
very different metallicities. To examine this issue more quantitatively, recall 
that the IMF is globally quite well-described by the Salpeter form of equa- 
tion (5.14). Let us assume that the stars in a globular cluster formed with a 
similar power-law mass function such that the number of stars with masses 
between M and M+dM was initially 


é(M)dM x M~*dM, (6.4) 


where the index a is as yet unknown. Since stars only lose significant amounts 
of mass when they evolve on to the subgiant branch and beyond, this function 
will also describe the mass distribution of MS stars. The stellar evolution 
calculations, which have been so successful in reproducing the CM diagrams 
of globular clusters (§6.1.2), also tell us how the absolute magnitude of a MS 
star in the cluster is related to its mass, M(M). Since this quantity varies 
monotonically for MS stars, we can invert it to obtain M(M); thus, we can 
use equation (5.13) to calculate the luminosity function ®(M) of MS stars 
which corresponds to the mass function of equation (6.4). 

The results of such a calculation are presented in Figure 6.15, which 
shows the luminosity functions obtained for a range of indices a and two 
different values of metallicity. Comparing Figures 6.14 and 6.15, it is clear 
that the differences between the luminosity functions for different clusters can 
largely be explained by their different metallicities. Moreover, the shapes of 
the observed luminosity functions are well-reproduced by the simple power- 
law mass function. Hence, as in §5.1.9, we conclude that the structure in the 
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Figure 6.14 The luminosity functions of globular clusters at mag- 
nitudes dominated by MS stars, normalized to a common value at 
My = 5.25. The names of the individual clusters are shown. The 
thickness of the lines increases with the metallicity of the clusters. 
{After McClure et al. (1986)] 


luminosity function arises from the details of the relation between luminosity 
and mass rather than any complexity in the distribution of stellar masses. 
Although the luminosity functions in Figure 6.14 can all be adequately 
reproduced by a power-law mass function, the required power-law index a 
varies from cluster to cluster — there is no universal mass function for globular 
cluster stars. McClure et al. (1986) find that a is smaller for metal-rich 
clusters than for metal-poor ones. This result implies that metal-rich cluster 
contain a larger fraction of high-mass stars. This correlation suggests that the 
extra high-mass stars in some clusters might have chemically enriched their 
environment early in the lifetime of the cluster, raising the metallicity of other 
stars that form (see Problem 6.4). It is also possible that the observed mass 
function is the result of nurture rather than nature in the sense that clusters 
all formed with a common IMF, but subsequent evolutionary effects have 
then altered their mass functions in different ways. For example, Capaccioli, 
Piotto & Stiavelli (1993) have suggested that clusters which form in the inner, 
high-metallicity parts of the Galaxy will have had more of their low-mass 
stars stripped away through interactions with the Galactic disk, resulting in 
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Figure 6.15 The predicted luminosity functions of globular clus- 
ters for a power-law mass function, normalized to a common value 
at My = 5.25. The heavy lines were calculated assuming a metal 
abundance of Z = 0.006 (comparable to that of 47 Tuc), and the 
light lines for Z = 0.0001 (comparable to that of M15). Each line 
is annotated by the adopted mass function index, a. These curves 
were calculated using the stellar evolution model of vandenBerg 
& Bell (1985), assuming an age of 16Gyr. [After McClure et al. 
(1986)] 


flatter present-day mass functions in these systems. Strong evidence that 
environment plays a key role in dictating the properties of globular clusters 
has come from the discovery that the slope of the mass function is more 
strongly correlated with a cluster’s location in the Milky Way than it is with 
the system’s metallicity (Djorgovski, Piotto & Capaccioli 1993). 

If a > 1, then integrating equation (6.4) over mass we find that there 
would have to be infinitely many low mass stars in the cluster. This result 
may seem implausible, but there is no observational evidence against it. With 
its superior imaging capabilities, HST has traced globular-cluster luminosity 
functions to very faint magnitudes. Figure 6.16 shows the results of several 
such studies. In every case, the luminosity function falls at the faint end. 
‘However, as we have discussed in §5.1.9, this drop is entirely consistent with 
a power-law mass function with a > 1, and merely reflects the steep nature of 
the function M(M) at low masses. The power-law form of €(M) could break 
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Figure 6.16 HST J-band measurements of the faint end of the 
MS luminosity function for the globular clusters w Cen, 47 Tuc, 
M15 and NGC 6397; the open clusters NGC 2420 and NGC 2477; 
and (for comparison) the solar neighborhood. The normalizations 
of the different functions are arbitrary. [After von Hippel et al. 
(1996)] 


down for M < 0.2 Mo, but the large uncertainties in the form of M(M) are 
such that the data currently provide no evidence for such a feature (Elson et 
al. 1995). 


6.1.8 Binary stars 


Binary stars play an important role in the evolution of globular clusters. The 
presence of even a small fraction of binary stars can affect the way in which 
the whole structure of the cluster evolves with time (see §6.1.10), and many 
of the oddities that we find in clusters such as millisecond pulsars (see §6.1.9) 
and blue stragglers may be connected with binary-star progenitors. Up to 
50% of field stars are found to be members of binaries, but there is no reason 
to suppose that the fraction in clusters is similar since the environment is so 
different. Binaries which formed with large separations will tend to be torn 
apart by tidal forces, either due to the cluster as a whole or due to encounters 
with single passing stars. Conversely, in the high density environment of the 
cluster, there will be occasional very close encounters between single stars 
which can then be tidally captured into binaries. It is not obvious where 
the equilibrium between these processes lies, but, on the average, we would 
expect binaries in globular clusters to occur at a smaller separations than 
those in the field. 

The observational techniques to search for binary stars in globular clus- 
ters have been extensively reviewed by Hut et al. (1992). Perhaps the sim- 
plest way to detect binaries is from the periodic Doppler shifts in their spec- 
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tral lines (see §3.1.2). Since the requisite high quality spectra can only be 
obtained for the brightest globular cluster stars, this technique is only use- 
ful for searching for giant-star binaries. Spectroscopic studies uncovered 
very few binaries in globular clusters: for example, Pryor, Latham & Hazen 
(1988) found only one spectroscopic binary in 110 giant stars in M3. How- 
ever, these studies will not detect any of the following: binaries with periods 
shorter than ~ 2 months or longer than ~ 10 years (since data have not been 
obtained which sample spectral variations on the appropriate timescale); bi- 
naries where the components have very unequal masses (since the Doppler 
motions of the brighter, more massive star decrease with increasing mass 
difference); and binaries with very eccentric orbits (since these stars spend 
most of their time at large separations moving at undetectably-low veloc- 
ities). Once corrections have been made for these large numbers of unde- 
tectable binaries, the inferred fraction of binary stars in globular clusters is 
almost as large as that seen in the field. 

We can extend the search for binaries to MS stars by careful studies of 
stellar photometry. Photometric surveys of stellar variability have uncovered 
a number of eclipsing binaries (see §3.2.5) in globular clusters. These studies 
preferentially detect very short-period binaries (P < 1d), since only stars 
very close to each other have a reasonable probability of being eclipsed. 
Although the statistics are not yet very good, the fraction of eclipsing binaries 
detected (something like 1 in 1000 stars) is again comparable to what we see 
in the field. 

Binary stars should also show up photometrically in the CM diagram. 
Imagine, for example, that a cluster contains a number of close binary sys- 
tems that are made up from pairs of identical MS stars. Such pairs would 
not be resolved in photometric observations, so we would misidentify the 
binary as a single object with the same color as the MS stars, but brighter 
by a factor of two. A population of such binaries would thus show up on 
the CM diagram as a second sequence brighter than the MS by 2.5log2 
magnitudes. If the stars are not identical pairs, they will not all lie along 
such a single line, but Romani & Weinberg (1991) have demonstrated that 
even unequal-brightness binaries will always produce a maximum along this 
second sequence in the CM diagram. In general, no second sequences are 
discernible in globular clusters, so this method only puts an upper limit on 
the fraction of MS stars in binaries, typically implying that < 3% are in such 
pairs. The clearest detection of a second sequence is in NGC 288, where 4% 
of the stars observed in the vicinity of the MS are offset by ~ 0.75 magnitudes 
on to a second sequence (Bolte 1992). However, the interpretation of such 
measurements is complicated because unrelated optical doubles (which are 
quite common in crowded globular cluster fields) will also lie on the second 
sequence. Further, very unequal-brightness pairings will lie very close to the 
normal MS in the CM diagram, so such binaries will not be detectable by 
this method. 


6.1 Globular clusters 361 


One last feature in the CM diagram that may be related to binary stars 
is the collection of blue stragglers, which are sometimes found on the MS 
beyond the turnoff point (see Figure 6.2). Their location implies that they 
are more massive than the bulk of MS stars, and such massive stars should 
have evolved away from the MS unless they are younger than the rest of the 
cluster. Since we believe that all the stars in the cluster formed in a single 
epoch, such objects are something of a mystery. However, the properties of 
a blue straggler can be understood if it formed relatively recently from the 
merger of two less massive stars. Such mergers will occur most readily when 
the two stars are already bound in a binary; the mass may be transferred 
slowly from one star to the other (McCrea 1964), or the two may collide 
when the binary loses energy through an encounter with a third star (Hills 
& Day 1975). Thus, the presence of blue stragglers provides further indirect 
evidence that globular clusters contain a population of close binary stars. 


6.1.9 Stellar remnants 


White dwarfs As we have seen in §5.1.3, stars end their days as compact 
stellar remnants. Single stars with initial masses M; S 8Mo evolve into 
white dwarfs, so all the single stars in globular clusters for which 8Mg 2 
M; 2 1Mo (the lower limit being set by the requirement that they have had 
time to complete their nuclear burning processes) will have become white 
dwarfs. For a typical globular cluster, white dwarfs make up a far-from- 
negligible ~ 10% of the total number of stars (see Problem 6.5). 

Since white dwarfs lack an internal energy source, they cool and dim over 
time, so most of the cluster white dwarfs will be unobservably faint. Even 
the most luminous white dwarfs are very faint when viewed from the distance 
to a typical globular cluster, and direct observations of the white dwarfs in 
crowded cluster fields have proved unattainable from the ground. However, 
the imaging capabilities of HST allow us to observe down to the requisite 
faint limit: Figure 6.17 shows the distinct sequence of white dwarfs in the CM 
diagram of the globular cluster M4. Since white dwarfs fade in luminosity 
quite rapidly, those that we see must all have reached this evolutionary stage 
recently and have formed from stars that lie in a narrow mass range, a little 
above the current turnoff mass. Hence their own masses must lie in a narrow 
range, and, unlike the MS stars also apparent in Figure 6.17, the white dwarfs 
are not stretched out into a sequence by differences in mass; instead they are 
stretched out by differences in ages, with those that have had longest to cool 
lying at the bottom right-hand end of the sequence. 

One possibility raised by observations of the white-dwarf cooling se- 
quence in a cluster’s CM diagram is that it might be used to measure the 
cluster’s distance more accurately. By comparing the location of the se- 
quence to the absolute magnitudes and colors of nearby white dwarfs with 
parallax distances, we can obtain the distance modulus to the cluster just as 
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Figure 6.17 CM diagram of the globular cluster M4 to very faint 
limits derived from the HST observations of Richer et al. (1995). 
The sequence of cooling white dwarfs is seen as the bluest stars 
in the cluster, stretching from U ~ 22 down to U ~ 26. The 
scattering of stars between the MS and white dwarf sequence is 
due to field-star contamination. [From data kindly provided by 
H. Richer] 


in the classical MS fitting technique (§6.1.2). Moreover, the atmospheres of 
white dwarfs are virtually metal free, so the metallicity corrections that must 
be made for MS fitting can safely be neglected, and the absence of convec- 
tion in these objects means that the physics which dictates their structure 
is better understood. Finally, there are more local white dwarfs than subd- 
warfs, so the calibration of white dwarf absolute magnitudes is much better 
constrained than that for the subdwarfs used in MS fitting. Renzini et al. 
(1996) have shown that this approach can be used to reduce the uncertainty 
in the distance modulus of the cluster NGC 6752 from 6(m — M) ~ 0.25 to 
6(m— M)~ 0.1. 

Neutron stars Stars with initial masses M; 2 8 Mo will have long-since 
passed through their nuclear burning lifetimes and become compact rem- 
nants, the vast majority of which will be neutron stars. Such old remnants 
will have cooled and dimmed to the point of unobservability. However, as 
we have seen in §6.1.8, some fraction of globular cluster stars live in bina- 
ries, and interactions between neutron stars in binaries and their companions 
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can render the neutron star luminous: if the components of the binary are 
close enough, material is transferred from the companion star to the neutron 
star, generating X-rays in the process. It is therefore noteworthy that glob- 
ular clusters contain 12 of the 100 brightest X-ray sources in the Milky Way 
(while they only contain ~ 10~* of the optical luminosity of the Galaxy). 
Those X-ray sources which have been optically identified have been shown 
to be spectroscopic binary systems, while many of the others show periodic 
variations which strongly suggest that they lie in binary systems {see Hut et 
al. (1992) for a review of the evidence]. The observations thus imply that 
these X-ray sources are binary stars in which one of the components is an 
accreting neutron star. 

In such a binary system, the companion transfers angular momentum 
as well as mass to the neutron star. This transfer increases the rate at which 
the neutron star spins. Rapidly-spinning neutron stars have been detected 
from their radio emission, which is modulated by the rotation of the neutron 
star with periods as short as ~ 2ms. More than 30 of these millisecond 
pulsars have now been found in globular clusters, with 11 in 47 Tuc alone 
(Robinson et al. 1995). Doppler shifts in the arrival time of their pulses have 
shown that more than a third of these pulsars are in binary systems, as we 
would expect if accretion is responsible for their rapid spins. 

One serious problem with this scenario of X-ray binaries evolving into 
millisecond pulsars is that there are estimated to be ~ 100 times as many 
millisecond pulsars as X-ray binaries in globular clusters.6 A possible ex- 
planation for this imbalance is that each system spends 100 times longer 
as a pulsar than as an X-ray binary, but theory indicates that the lifetimes 
of both stages should be ~ 1 Gyr (Bhattacharya & van den Heuvel 1991). 
Alternatively, some entirely different mechanism may be generating most of 
the millisecond pulsars: for example, Michel (1987) has suggested that white 
dwarfs in binary systems could accrete sufficient mass and angular momen- 
tum to turn into millisecond pulsars without passing through a lengthy X-ray 
binary stage. 


6.1.10 Radial profiles 


Having discussed the properties of the stars that make up a globular cluster, 
we now turn to their arrangement within the cluster. Since these systems 
appear remarkably round, the distribution of light in a cluster is almost 
completely specified by the radial brightness profile. Since we can resolve 
the stars in nearby globular clusters, there are two distinct approaches that 
can be used to measure their brightness profiles: either we can measure the 
brightnesses and locations of individual stars and use these data to calculate 


6 We have detected essentially all the ~ 10 X-ray binaries in Milky Way globular 
clusters, but only a few percent of the ~ 10° millisecond pulsars are detectable because 
most do not beam their emission towards us. 
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the number density of stars as a function of radius, or we can measure the 
integrated starlight as a function of radius using the techniques of integrated 
surface photometry that we discussed in the context of measuring galaxy 
brightness profiles (§4.2). 

The Milky Way’s globular clusters are sufficiently close that in their 
outer parts we can easily resolve their individual stars with large amounts 
of empty space in between cluster members. It is thus very inefficient to 
measure their outer brightness profiles using integrated starlight since most 
of the area over which one integrates contains no contribution from the clus- 
ter stars, but the whole area contributes towards the noise in the data via 
the intrinsic brightness of the night sky, unrelated background galaxies, etc. 
Thus counting individual stars is the method of choice for calculating clus- 
ters’ outer brightness profiles. The only sources of noise in these data then 
arise from the Poisson statistics from the finite number of stars observed and 
confusion from Galactic stars along the line of sight to the cluster. The con- 
tribution from unrelated Galactic stars can be estimated by observing other 
nearby fields, but it is also possible to use the colors (§6.1.2) and velocities 
(§6.1.11) of individual stars to assess their membership of a cluster and thus 
reduce contamination from non-members to negligible proportions. 

At small radii in a cluster, the density of stars can become very large. 
It then becomes difficult to resolve individual stars, so the surface-brightness 
profile is most straightforwardly calculated using the integrated light of the 
cluster. Fortunately, the relative contribution from other sources along the 
line of sight is generally negligible in the bright central region of the cluster. 
The surface brightness profile derived by this method can then be normalized 
to the star counts in an intermediate region of overlap, giving the brightness 
profile at all radii. This approach has been somewhat superseded with the 
advent of the Hubble Space Telescope, which has made it possible to re- 
solve individual stars all the way into the very cores of clusters. These new 
data also reduce the degradation of the very centrally-concentrated cluster 
brightness profiles which occurs due to ground-based seeing (§4.2.2). Even 
with the resolving power of HST, however, there is still significant confusion 
in the cluster core, as the high surface density of stars means that adjacent 
stars will sometimes fail to be resolved, and hence will be misidentified as 
single sources. The number counts must then be corrected for the fraction 
of sources which have been missed, as discussed in §6.1.1. 

It is also worth noting that the finite number of stars in a globular 
cluster imposes a fundamental limitation in the calculation of its brightness 
profile. At large radii, the low density of stars implies that the surface 
brightness can vary significantly just from the Poisson statistics unless we 
average over a significant range in radii. Even at small radii, the light is 
dominated by relatively few bright giant stars, so the concept of a smooth 
symmetric distribution of light is an oversimplification: there is, for example, 
no reason why the brightest point in the cluster should coincide exactly with 
the centroid of the positions of all the stars, and so the “exact” center of 
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Figure 6.18 plot of surface brightness, 4, versus projected radius 
for the globular cluster M3. The profile at small radii (solid sym- 
bols) is derived from integrated photometry, while that at large 
radii (crosses) is derived from star counts. The dashed line shows 
the King model which best fits these data, and the solid line shows 
a more sophisticated multi-mass King model fit. [After Da Costa 
& Freeman (1976)] 


the cluster is not a well-defined quantity. It makes no sense to analyze the 
radial brightness profile on scales smaller than that set by this fundamental 
uncertainty, which, even in the most carefully-studied systems, may still be 
as large as an arcsecond (Picard & Johnston 1994). 


Large-scale properties Figure 6.18 shows the complete brightness profile 
of the nearby cluster M3 derived by combining star counts at large radii with 
integrated photometry at small radii. The profile extends over 5 orders of 
magnitude in surface brightness. This figure also illustrates the use of such 
data in trying to understand the dynamical properties of globular clusters. 
The dashed line shows the best fit that can be obtained by fitting a King 
model to the observed brightness profile. As discussed in BT §4.4, the King 
model makes the physically-reasonable assumption that the most energetic 
stars in the cluster, which venture to large distances from its center, will have 
been removed entirely by the tidal influence of the Milky Way. These losses 
result in a cluster which is truncated at a finite tidal radius, r;. The only 
other variable parameters in this model are the central surface brightness 
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of the cluster and the core radius, r,, which is defined to be the radius 
at which the surface brightness has fallen to half of its central value.’ The 
concentration c of the cluster, defined by c = log(r:/r.), then provides a 
useful distance-independent measure of the cluster’s morphology with a more 
physical interpretation than the concentration class. As can be seen from 
Figure 6.18, although the simple King model predicts some of the properties 
that we see in the surface-brightness profile of M3, the best fit is far from 
perfect; it is only because we are able to probe the surface density of the 
cluster over such a large dynamic range that the shortcomings of this simple 
model are apparent. 

One shortcoming of the simple King model is that it assumes that all the 
stars in the cluster have identical masses. As we have seen in §6.1.7, stars in 
globular clusters have a range of masses. It is most unlikely that stars of dif 
ferent masses will have identical kinematics: globular clusters are sufficiently 
old that encounters between stars of different masses will have significantly 
re-distributed their initial energies, driving them towards equipartition. On 
average, we therefore expect high-mass stars to be traveling more slowly 
than low-mass stars, so that their kinetic energies are approximately equal 
(BT §8.4.3). The solid line in Figure 6.18 shows the results of fitting a model 
which allows for the different kinematics of stars of different masses. This 
multi-mass King model divides the stellar population of the cluster into 
a number of different mass classes. Each class has a central velocity disper- 
sion chosen such that at the center all stars are in energy equipartition. The 
fraction of stars in each class is determined by the observed distribution of 
stellar luminosities, with unobserved faint white dwarfs and neutron stars 
(§6.1.9) providing extra contributors to the mass of the system (but not its 
light). A dynamical model can then be generated which defines the kine- 
matics of the individual classes in the gravitational potential arising from 
the mass distribution of all the stars. Adding the luminosity contributions 
from the different classes of star then gives the total surface-brightness profile 
indicated in Figure 6.18, which provides an excellent fit to the data. 

Given the large number of parameters that define a multi-mass King 
model, it is not surprising that a model of this class provides a good fit to 
the photometry of M3. Other models are also possible: the assumption of 
full equipartition of energy between all stellar components can be relaxed, or 
we could consider more complex arrangements of stellar orbits. In fact, many 
different models can reproduce the observed brightness profile of a cluster, 
and it is only by examining other properties of the cluster such as its stellar 
kinematics (§6.1.11) that we can begin to distinguish between these various 


7 A natural parameter of a King model is the King radius, ro, that is defined by BT 
equation (4-124b). In the limit of large r:/rc, the surface brightness at ro has fallen to 
0.5013 of its central value, so ro ~ rc. In view of this result, ro is frequently called the 
core radius. This practice is misleading, however, because r- is significantly smaller than 
ro when r;/re is not very large — see Problem 6.6. 
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globular cluster models. 


Luminosity segregation One conceptually-simple test of models of clus- 
ter evolution is to search for luminosity segregation. If equipartition of energy 
is important, as multi-mass King models assume, slower-moving more mas- 
sive stars should be more concentrated toward the center of the cluster than 
the less massive stars. Observational support for this idea comes from the 
distributions of X-ray sources and blue stragglers in clusters. These objects 
are associated with binary systems (§6.1.8), which are more massive than 
single stars. Hence the finding that they are more centrally concentrated 
than normal stars (Grindlay et al. 1984, Nemec & Harris 1987) constitutes 
clear evidence of mass segregation. 

We would also expect mass segregation to show up as a variation in the 
normal stellar luminosity function with radius. Bright, massive stars will 
make up a smaller fraction of the total stellar population at large radii than 
they do close to the center of the cluster, so the observed luminosity func- 
tion should become increasingly steep with radius. In practice, this effect 
has proved difficult to detect. Unless observations are obtained over a wide 
range of radii, the variations in the luminosity function are rather small. Un- 
fortunately, data obtained over a wide range in radii are subject to a variety 
of systematic errors: the contribution from possible background stellar con- 
tamination becomes large at large radii where the number density of cluster 
members is low; and the effects of stellar crowding become acute at small 
radii where the density is high. We therefore have to be certain that these 
systematic effects are well-understood before an unequivocal measurement 
of luminosity segregation can be made. 

Figure 6.19 shows the luminosity function of the cluster M30 derived 
at a number of radii. These data imply that the fraction of faint stars in 
this cluster increases significantly at large radii. Unfortunately, these ground- 
based data may have been compromised by the crowding near the center. For 
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Figure 6.20 The luminosity function of the globular cluster 47 Tuc 
derived from HST data within the central 4 arcsec of the cluster. 
The line indicates the luminosity function for faint stars at large 
radii derived from ground-based data by Hesser et al. (1987). [After 
Paresce, de Marchi & Jedrzejewski (1995)] 


the innermost field presented in Figure 6.19, the star counts at the faintest 
magnitudes have been corrected by more than a factor of three; this inner 
field is so crowded that more than two-thirds of the abundant faint stars will 
have been missed due to their proximity to other stars. The credibility of the 
apparent difference between the luminosity functions shown in Figure 6.19 , 
is therefore dependent on the reliability with which these large correction 
factors can be determined. 

Figure 6.20 shows the luminosity function derived from HST observa- 
tions of the innermost four arcseconds of 47 Tuc. Even at these small radii 
and at the faintest magnitudes detected, the post-repair HST images are 
so sharp that fewer than 10% of the stars are lost due to crowding, so the 
corrections to the luminosity function are small. This figure also shows the 
luminosity function derived from ground-based data at large distances from 
the center of the cluster, where the effects of crowding are similarly small. 
Clearly, with the small potential for systematic uncertainties due to crowd- 
ing in these data, the difference between the luminosity functions at small 
and large radii in this system must be real, and luminosity segregation is 
occurring. 

One consequence of the variation with radius in globular-cluster lumi- 
nosity functions is that we must be rather careful when comparing luminosity 
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Figure 6.21 surface-brightness profiles of globular clusters con- 
trasting those with flat cores (left panel) to those with central cusps 
(right panel). The identifications of the clusters and photometric 
bands of the data are as indicated. [After Djorgovski (1988) cour- 
tesy of S. Djorgovski] 


functions which have been derived from different data sets. If these analyses 
are based on observations made at different radii in the clusters, then even 
intrinsically identical globular clusters will appear to have different luminos- 
ity functions. Since crowding becomes a more serious problem in systems 
at greater distances, luminosity functions of more distant clusters are likely 
to be derived from stars at larger radii than those used in the analysis of 
nearer clusters. Hence, if we are not careful we will end up with a distance- 
dependent bias in the luminosity function. If we adopt a particular dynamical 
model for each cluster, then the effects of mass segregation can be estimated, 
and the observed luminosity function at a particular radius can be corrected 
to give the global luminosity function for the cluster. However, the accuracy 
of luminosity functions derived in this way depends on the validity of the 
underlying dynamical models. 


Central cusps A generic property of the King-type models discussed 
above is that inside the core radius, r,, the surface density of stars tends 
towards a constant value. Many clusters do display such flat cores in their 
surface-brightness profiles, but there are also exceptions. In the case of M15, 
for example, the surface brightness continues to rise all the way to its center. 
When we factor in the effects of seeing, which will tend to smear out any 
bright central cusp in the brightness distribution (§4.2.2), it is apparent that 
M15 must have a remarkably sharp central peak in its brightness profile. 
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Figure 6.22 The surface density 
of stars brighter than my = 19 
as a function of projected radius in 
the globular cluster M15, as derived 
from a HST image (Guhathakurta 
et al. 1996). The dotted line shows 
a surface-brightness profile with a 
power law of index —0.75, as would 
be expected for the if the cluster 
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The central cusp in M15 has been recognized for quite some time (e.g. 
King 1975), but it was treated as something of an oddity amongst globular 
clusters. However, the rarity of such central cusps simply reflected the fact 
that no-one had made any systematic search for the phenomenon. Interest 
in these systems began to grow in the early 1980s when a number of nu- 
merical studies of spherical self-gravitating bodies showed that, even if such 
systems initially contain a flat core, the central density increases dramati- 
cally over time, ultimately producing a central power-law cusp in the radial 
profile. This phenomenon of core collapse, discussed in BT §8.4.2, occurs 
generically in any stellar system that is old enough for two-body encounters 
to have redistributed the energy of the system between stars. This runaway 
process is only halted when binary stars at the center of the cluster provide 
a source of energy to halt the collapse (BT 88.4.6). Since globular clusters 
are all believed to be old systems, it is hard to see how they could have 
avoided collapsing to form central cusps similar to that in M15. Djorgovski 
& King (1986) therefore undertook a systematic search for collapsed cores by 
obtaining CCD photometry for a sample of 113 globular clusters, and found 
that clusters with central cusps are not all that unusual (see Figure 6.21). 
In fact, they discovered that around 20% of the clusters in their sample have 
surface-brightness profiles which cannot be described adequately by a model 
containing a flat central region, suggesting that these systems are examples 
of core-collapsed clusters. However, all the clusters they observed were old 
enough to have undergone core collapse, so the absence of cusps at the cen- 
ters of 80% of the systems implies that there must be an efficient dynamical 
mechanism for re-inflating collapsed cores. 

The repaired HST has enabled us to investigate the central regions of 
clusters in unprecedented detail. Figure 6.22 shows the surface density of 
stars brighter than my = 19 as derived from an HST image of M15. Although 
the steepness of the logarithmic gradient clearly drops as we look to smaller 
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and smaller radii in this system, there is no evidence that it flattens out 
entirely even on the small scales probed by these data. The largest core 
radius consistent with this observation is only r. ~ 1 arcsec. Moreover, the 
location of the center has an inherent uncertainty of ~ 0.3 arcsec on account 
of the finite number of stars in the system. Hence there is little chance of 
there being even a very small homogeneous core in M15. 

It is worth noting that core collapse is not the only possible mechanism 
for generating central cusps in the surface brightness profile. One may show 
that if a globular cluster harbors a massive black hole in its core, the black 
hole will gather a cusp of stars around it (Bahcall & Wolf 1976). In fact, 
stars in such a system will produce a surface-brightness profile at small radii 
which follows the power law, I(R) « r~°-7>; as Figure 6.22 shows, such a 
power law matches the observed surface-brightness profile of M15 out to 
several arcseconds remarkably well. However, such a power-law slope is also 
quite consistent with the predictions of core collapse models. As with the 
larger-scale properties of globular clusters, we need more information than 
just the observed surface brightness in order to discriminate between these 
different explanations for central cusps. For example, we could measure the 
stellar kinematics of this central region. A central black hole would reveal 
its presence by an increase in the random velocities of stars in its immediate 
vicinity, whereas no such increase would be apparent if the central cusp were 
the result of core collapse. 


6.1.11 Kinematics 


Velocities of individual stars Since we can resolve individual stars 
in globular clusters, their individual line-of-sight velocities can be measured 
from the Doppler shifts in their spectral lines. As we shall see, the velocities of 
stars in a globular cluster typically differ by only ~ 5kms~!, so one requires 
optical spectra of sufficiently high quality to permit the measurement of 
shifts in spectral lines of S 0.01nm. Only giant stars are bright enough for 
spectra of the required quality to be obtained, so only these objects can be 
used as kinematic tracers of the dynamics of globular clusters. Furthermore, 
large numbers of velocities are necessary: in a pioneering study, Gunn & 
Griffin (1979) obtained spectra of 111 giant stars in M3, and one of their 
principal conclusions was that this sample was too small to discriminate 
between many different dynamical models. Subsequently, observers obtained 
spectra for bigger samples of stars in a number of globular clusters. One of 
the largest such surveys consisted of line-of-sight velocities for 318 stars in 
w Cen (Meylan & Mayor 1986). These data were derived by laboriously 
obtaining spectra for each star in the sample, with repeated observations 
made to identify the stars that are in binary systems — the velocities of 
such stars are confused by the orbital motion of the binary. These projects 
therefore required large investments of telescope time. The development of 
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multi-object spectrographs (in which the light from many stars is channeled 
by fiber optic cables to the spectrograph) made it possible to determine 
globular cluster kinematics much more efficiently: Cété et al. (1995) have 
made use of such a spectrograph to measure the velocities of 399 stars in 
NGC 3201, and even larger samples lie within reach. 

The simplest issue we can address with such data is whether globular 
clusters are rotationally flattened. As we shall see in §11.2.1, rotational 
motion in a stellar system tends to flatten it. In the case of globular clusters, 
the degree of flattening is usually rather small, so the amount of rotation 
required is also minimal. For typical cluster ellipticities of « < 0.1, the 
expected ratio of rotational to random motion is v/o < 0.3. In general, the 
observed kinematics and ellipticities of globular clusters are quite consistent 
with their being rotationally-flattened systems, although the rotation rates 
and ellipticities of these systems are sufficiently small that they are subject 
to significant observational errors. 

The small rotation rates and circular shapes of globular clusters mean 
that we can treat them as spherical systems to a good approximation. Kine- 
matic observations therefore provide a data set that consists of the line-of- 
sight velocity relative to the cluster mean, v.;, and projected separation 
from the cluster center, R, for each of the giant stars observed. As an illus- 
tration of such a data set, Figure 6.23 shows the line-of-sight velocities that 
Peterson, Seitzer & Cudworth (1989) derived from spectra of 120 stars in 
M15 as a function of their projected distance from the cluster center. It is 
clear from this figure that the amplitude of random stellar velocities drops 
with radius: calculations of the velocity dispersion in these data show that it 
decreases from ~ 15kms~! at small radii to ~ 5kms~! in the outer parts of 
the cluster. Such a decrease in dispersion with radius is a generic property 
of any stellar system of finite extent which is populated by stars on isotropic 
orbits (like the King model discussed above). To see that this is the case, 
imagine a cluster wholly contained within a radius r;. Any star currently 
lying at r; must have a radial velocity of zero, since any non-zero value would 
imply that the star follows an orbit which would take it beyond r;. By defi- 
nition, the velocity distribution of stars at any point in an isotropic system 
is equal in the tangential and radial directions, so the tangential velocities of 
stars in the outermost part of the cluster must also tend to zero. Thus, the 
observed line-of-sight velocities must drop toward zero as we near the edge 
of the cluster. 

This description provides a qualitative explanation for the drop in dis- 
persion with radius, but it often fails to match the observed kinematics in 
detail. In particular, the multi-mass King models that fit the photometric 
profiles of clusters generally predict a slower drop in the line-of-sight velocity 
dispersion with radius than is measured. This difference can be explained if 
the orbits in globular clusters are not isotropic. At large radii in a spherical 
system, the radial component of a star’s velocity is mostly transverse to the 
line of sight, so its line-of-sight velocity is primarily made up from the star’s 
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Figure 6.23 Line-of-sight velocities of giant stars in M15 as a 
function of their projected radii. [From data published in Peterson, 
Seitzer & Cudworth (1989)] 


tangential motion (see BT Figure 4-4). Thus, the observed rapid drop in the 
line-of-sight velocity dispersion can be explained if the stellar orbits at large 
radii are primarily radial with little tangential motion. Such an arrangement 
of orbits has some physical motivation: interactions between stars in the core 
place numbers of stars on highly eccentric orbits that reach from the core 
to large radii in the cluster. Such an eccentric orbit can be circularized only 
by an encounter with a star at a large radius; since the stellar density at 
large radii is small, such a circularizing encounter is improbable. Hence at 
large radii we now see an accumulated population of stars on deeply plunging 
orbits whose radial velocities generally exceed their tangential ones. 

A generalization of the King model which allows for radial anisotropy in 
the orbits at large radii is provided by the Michie model (BT 4.4.4). This 
model adds a new parameter, the anisotropy radius r,, a kinematic length- 
scale which supplements the photometric lengthscales, r, and r;. Within r,, 
the distribution of orbits is close to isotropic, but beyond r, the orbits rapidly 
become radial. As for the isotropic King model, the Michie model can be 
generalized for systems with more than one stellar species as a multi-mass 
Michie model, which has been successfully applied to the modeling of both 
the photometry and kinematics of globular clusters. The radius at which 
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anisotropy becomes significant varies from cluster to cluster: in the case of 
w Cen, for example, Meylan (1987) has shown that only models with rz S 3r, 
can reproduce the observed photometry and kinematics satisfactorily, while 
Cété et al. (1995) have found that NGC 3201 can be adequately described 
by a model in which the stellar orbits remain isotropic all the way to the 
edge of the cluster (ra 2 rz). 

Although the generalized Michie models provide a good fit to the ob- 
served kinematics of globular clusters, these models are by no means unique. 
Even with the extra constraints provided by the line-of-sight velocities of 
stars, there is still a wide range of models which fit the observations. As 
with the kinematics of galaxies (§11.2.1), there is a trade-off between the 
distribution of stellar orbits and the form of the gravitational potential which 
enables us to reproduce the observed properties of a globular cluster with 
many different dynamical models. 

Even though the gravitational potential derived from such dynamical 
studies is model-dependent, it is still often used to determine an estimate 
for the mass of the cluster, and hence the global mass-to-light ratio for the 
system, T = (M/Mo)/(L/L@). In a study of 56 globular clusters, Pryor 
& Meylan (1993) found that globular clusters have a mean mass-to-V-band- 
light ratio of Ty = 2.3. The calculated mass-to-light ratio varies significantly 
from cluster to cluster, and the dispersion in the values of Ty in this sample 
is 1.1. There are few clues as to why the mass-to-light ratio varies between 
clusters: its value does not correlate with the metallicity of the cluster, its 
location in the Galaxy, or its degree of central concentration. Pryor & Meylan 
(1993) did find a significant positive correlation between Ty and the total 
mass of the cluster, but since the mass is used to calculate Ty, random errors 
in the calculated mass would naturally give rise to such a correlation, so the 
reality of the effect remains in doubt. 


Integrated-light kinematics At small radii in a cluster, crowding can 
make it impossible to obtain the spectra of individual stars. This problem is 
particularly acute in clusters with central density cusps, which is unfortunate, 
since it is for these systems that the central kinematics offers an important 
observational constraint on the nature of the central cusp (§6.1.10). For such 
clusters, we must resort to the integrated light spectral techniques that are 
used to study the stellar kinematics of galaxies (§11.1). Peterson, Seitzer 
& Cudworth (1989) obtained spectra of the central arcsecond of M15, the 
archetypal cusped cluster, and used cross-correlation analysis to obtain a 
velocity dispersion of ~ 25kms~! for the stars in this region. This velocity 
dispersion appears to be significantly larger than the ~ 15kms~! measured 
from individual stars slightly further out in the cluster (see Figure 6.23). 
Peterson, Seitzer & Cudworth (1989) have suggested that both the luminosity 
cusp and the high central velocity dispersion could result from the presence 
of a central black hole of mass ~ 10? Mg. However, as Dubath, Meylan & 
Mayor (1994) have pointed out, the HST images of the core of M15 have 
shown that the central arcsecond is dominated by the light of just three 
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giant stars. The observed central velocity dispersion is thus at the mercy 
of small number statistics. If the three stars were drawn from a population 
with an intrinsic velocity dispersion of 15kms~1, their measured velocity 
dispersion could plausibly lie anywhere between ~ 5kms—! and ~ 25kms7} 
(see Problem 6.7). Thus, the large value for the central velocity dispersion 
in M15 as derived from the integrated light is consistent with a velocity 
dispersion which tends to a constant 15kms~! at small radii, and so there 
is no need to invoke any strange behavior in the kinematics at small radii. 
Small number statistics present a fundamental limitation to studies of stellar 
kinematics in the very central regions of clusters. 


Proper motions We can also investigate the internal kinematics of glob- 
ular clusters through the proper motions of their stars (§2.1.4). For a nearby 
globular cluster at a distance of ~ 5kpc, a typical transverse stellar veloc- 
ity within the cluster of ~ 5kms~? will result in a proper motion of only 
~ 0.2 mas yr~!. In order to detect such small shifts, studies must use archival 
photographic plates often reaching back as far as the beginning of the cen- 
tury. Even with such a long temporal baseline, the shifts in stellar positions 
are still very small, so careful astrometric measurements must be made in 
order to determine proper motions. These measurements are by no means 
simple, since telescopic images — particularly those recorded on the older 
photographic plates — can be significantly distorted. The analysis procedure 
must therefore solve for these largescale distortions which will displace the 
apparent locations of stars as well as the intrinsic stellar proper motions (e.g. 
Cudworth 1985). 

As an example of the results of such an analysis, Figure 6.24 shows 
the proper motions of stars in M5 as measured by Rees (1993) from plates 
obtained between 1900 and 1982. The left panel shows the angular velocity 
of each star on the plane of the sky, (uz, uy), relative to the cluster as a 
whole. The cluster members are concentrated quite tightly about the origin 
in this plot, reflecting the system’s small internal velocity dispersion, while 
contaminating field stars have a much wider spread in proper motions, with a 
different mean velocity. Clearly, proper motions provide an efficient tool for 
identifying cluster members with very little ambiguity. Such measurements 
are thus very useful for cleaning up CM diagrams by removing interloping 
field stars. 

Although the cluster members are tightly concentrated about the origin 
in this plot, they do show some scatter. This scatter can be quantified by 
fitting a Gaussian distribution to the arrangement of cluster members in 
the (42, fy) plane, and measuring its dispersion, o¢., which for the data 
in Figure 6.24 is ~ 0.4masyr~!. Part of this dispersion can be attributed 
to the finite uncertainties in the derived proper motions; even the careful 
analysis applied to these data results in RMS proper motion uncertainties of 
Jerr ~ 0.1 masyr~!. However, otot is significantly greater than Gerr, so it is 
clear that data of this quality are detecting the intrinsic motions of the stars, 
which contribute a characteristic RMS proper motion oj,;. If both the errors 
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Figure 6.24 Proper motions of stars in the field containing the globular cluster M5. The 
left panel shows the angular velocity for each star, {u2, “y}; the stars identified as cluster 
members are shown as solid points, while field stars are marked by open circles. The right 
panel shows the locations of these stars on the sky, with lines marking where the proper 
motions predict that the cluster members will have moved to in 5 x 104 years. Proper 
motions have not been determined at small radii due to crowding. [From data published 
in Rees (1993)] 


and the intrinsic motions have Gaussian distributions in proper motion, then 
we can sum their contributions in quadrature so that 02, = 02, + 024. 
The simplest use for this kinematic information is in determining the 
distance to the cluster via statistical parallax (see §2.2.4): if we equate the 
motions on the plane of the sky to the line-of-sight velocity dispersion, ojo, 
as measured from the Doppler shifts of cluster members, then we find the 
distance to be D = oys/oin, (where the angular shifts in oj, are measured 
in radians). In practice, this formulation is too simplistic since, as we have 
seen above, the distribution of stellar velocities is not isotropic, and so the 
transverse and line-of-sight velocities will not in general be equal. This com- 
plication can be overcome by using the radial velocity data alone to calculate 
a full dynamical model for the stellar system. This model then allows us to 
predict the distribution of transverse velocities that we expect to observe at 
the locations in the cluster where stellar proper motions have been measured. 
The distance to the cluster can then be determined by reconciling the ob- 
served angular velocities to the predicted distributions of spatial velocities. 
Ultimately, this method offers an independent method for determining the 
distances to globular clusters which can be used to calibrate other impor- 
tant phenomena such as the absolute magnitudes of the MS and RR Lyrae 
stars. The main limitation at present is in uncertainties in the size of Gerry, 
since a mistake in estimating this contribution to the observed proper mo- 
tion will cause a corresponding error in the estimate for the intrinsic proper 
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motion. If this problem can be overcome, proper motions can be used to 
derive distances to globular clusters to better than 10% (Lupton, Gunn & 
Griffin 1987). 

Ultimately, there is a wealth of information that might be derived from 
samples of proper motions like those shown in the right panel of Figure 6.24. 
Leonard & Merritt (1989) have demonstrated that from a large sample of 
accurately-measured proper motions at all radii in a cluster it is possible 
to determine not only the shape and amplitude of the velocity ellipsoid 
(Box 10.2) but also the underlying gravitational potential of the cluster. 
Although existing proper motion data sets have neither the quality nor the 
quantity of data for such a complete analysis, some conclusions can already 
be drawn. For example, Lupton, Gunn & Griffin (1987) have shown that 
proper motions of stars at large radii in M13 are preferentially oriented to- 
ward (or away from) the cluster center, which implies that the stellar orbits 
are radially-dominated in the outer parts of this cluster, as predicted by the 
analysis of line-of-sight velocities. With improving imaging technology mak- 
ing it possible to measure ever smaller shifts in the positions of ever fainter 
stars, the gathering of large samples of high-quality proper motion measure- 
ments in globular clusters will soon greatly expand our understanding of the 
kinematics of these systems. 


6.2 Open clusters 


Not all the agglomerations of stars in the Milky Way have the tidy struc- 
ture of globular clusters. As Figure 6.25 illustrates, many star clusters show 
little symmetry in their appearance, and lack the central concentration char- 
acteristic of globular clusters. Because of their more diffuse morphology, 
these systems are known as open clusters. Unlike the globular clusters, 
open clusters are found to be strongly concentrated toward the plane of the 
Milky Way (see Table 6.2), and hence they are also sometimes referred to as 
Galactic clusters. 

On account of their rag-tag appearance and low central concentration, 
open clusters are less readily picked out by the eye than the symmetric 
globular clusters. Moreover, they occur near the Galactic plane, where they 
tend to be heavily obscured by dust and can easily be lost amongst the 
high density of field stars. Consequently, the Messier catalog lists only 27 
open clusters. Nevertheless, they are intrinsically plentiful systems: the 
database compiled by Lynga (1987) lists some 1200 open clusters, and even 
this catalog covers only a small fraction of the Milky Way. Typical properties 
for a selection of open clusters are presented in Table 6.2. 

Open clusters cover a broad spectrum in size, luminosity and morphol- 
ogy, ranging from groupings with a dozen or so members with integrated 
magnitudes of My ~ —3 up to systems of many thousands of stars with 
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Figure 6.25 Negative image of the pair of open clusters h and x 
Persei. The area shown in this figure is approximately 1 deg”. [DSS 
image from the Palomar/National Geographic Society Sky Survey, 
reproduced by permission] 


integrated magnitudes as high as My ~ —9. Their diameters range up 
to a few parsecs. Since they are fairly large objects, and we can usually 
only detect relatively nearby examples, open clusters frequently cover large 
areas on the sky, subtending angles of several degrees. The densities of 
these systems range from only marginally higher than the field-star density 
(~ 0.1 stars pc~*) up to ~ 10° stars pc~? at the centers of the richest clusters. 
The lowest density systems, also termed associations, can frequently only 
be distinguished from the field because they contain a high fraction of stars 
of an unusual type, leading to a noticeable enhancement in the projected 
density of these stars in the direction of the cluster. OB associations, for 
example, are aggregations of stars containing a large fraction of O and B 
stars, while T associations are rich in T Tauri stars. 
The properties of an open cluster can be usefully summarized by its 
Trumpler classification (Trumpler 1930), which specifies: 
1) its degree of concentration as a roman numeral I...IV, where lower 
numbers indicate higher degrees of central concentration; 
2) the range in brightness of its stars measured on a scale 1...3, where 
lower numbers indicate a smaller range of stellar brightnesses; 


Table 6.2 Characteristics of selected open clusters 


Name D (R, 2) gal 
NGC Other (kpc) (kpc) 
2264 0.79 (9.23,0.03) 
6705 Mil 1.72 (6.96, —0.08) 
188 - 1.55 (9.35,0.58) 
6791 - 4.20  (8.12,0.80) 
7261 = 2.12 (9.23,0.03) 
= Berkeley 17 2.40 (10.89, —0.15) 
7 Berkeley 20 8.14 (16.12, —2.42) 
- Hyades 0.05 (8.55, —0.02) 
- Coma, Mel 111 0.08 (8.56, 0.07) 
- Pleiades, M45 0.13 (8.63, —0.05) 
2632 Praesepe, M44 0.16 (8.64,0.08) 
869 h Persei 2.23 (10.19, —0.14) 
884 x Persei 2.22 (10.18, —0.14) 


My 


—5.4 
—5.4 
—2.9 
—-3.6 
—-3.2 


Diameter Trumpler 


(B — V)to log(t/yr) 


(arcmin) class 

40 III,3,p,n —0.25 
13 I,2,r,- —0.05 
15 TI,2,r,- +0.58 
10 1,2,r,- +0.60 
6 III,1,p,- —0.25 
8 III,1,r,- +0.58 

3 1,3,p,- ~ 
329 II,3,m,- +0.12 
275 III,3,p,- +0.05 
109 1,3,r.n —0.11 
95 II,3,m,- +0.15 
29 I,3,r,- —0.25 
29 I,3,r,- —0.25 


6.5 — 7.0 
8.4 
9.8 
10.0 
7.6 
10.1 
9.7 
8.8 


8.6 
8.0 
8.8 
6.7 


6.7 


[Fe/H] 


Comments 


—0.15 
+0.05 
—0.16 
+0.15 
—0.46 
—0.29 
—0.75 
+0.19 


—0.03 
+0.11 
+0.19 
~0.05 


—0.05 


Typical young cluster 

Typical intermediate-age cluster 
Typical old cluster 

Very old metal-rich cluster 
Young metal-poor cluster 
Oldest known cluster 
Metal-poorest known cluster 
Nearby cluster, classically used 
to establish the distance scale 
Nearby poor cluster 

Nearby well-studied cluster 
Nearby well-studied cluster 
Rich young cluster, twin 

with x Persei 

Rich young cluster, twin 

with h Persei 


NOTES: D is the distance to the cluster; (FR, z)gai are its cylindrical coordinates relative to the Galactic center; (B — V)7o is the color of 
its main sequence turn-off point; t is its estimated age; other quantities are as defined in the chapter. 


SOURCE: From the data collected in Lynga (1987), Meynet, Mermilliod & Maeder (1993) and Friel (1995). 
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Figure 6.26 Negative image of M45, the Pleiades open star cluster. 
Note the nebulosity surrounding the stars [Image kindly provided 
by J. Ware (http://www. galaxyphoto.com/)| 


3) its richness, specified as p, m, or r depending on whether the system is 
poor (containing less than 50 stars), moderate (50 — 100 stars), or rich 
(more than 100 stars). 

An ‘n’ is appended to the classification if there is nebulous, diffuse emission 
around the cluster’s stars. For example, the Pleiades (see Figure 6.26) is 
designated 1,3,r,n in this scheme. 

The presence of diffuse emission from around the stars in many open 
clusters provides a striking contrast with the purely stellar emission from 
globular clusters. This nebulosity arises from starlight which has been re- 
flected by dust grains in the cluster. As is usual in astrophysical systems, this 
dust is accompanied by gaseous material, and so, unlike globular clusters, 
open clusters contain a significant ISM. This observation ties in closely with 
a further difference between the two classes of cluster: unlike globular clus- 
ters, many open clusters contain bright, blue stars. As we discussed above, 
we believe globular clusters are old because they lack such stars; conversely, 
the presence of bright blue stars in many open clusters implies that these sys- 
tems formed much more recently. As we have seen in §5.3.2, star formation 
is an intrinsically inefficient process, and the gas in open clusters is material 
that has not been converted into stars. In the youngest star clusters, we can 
observe directly the dense gaseous cores from which stars form and T Tauri 
stars, which are believed to be stars in the final stages of formation (§5.1.8). 
Hence star formation is an on-going process in these systems. 

Further support for the recent origin of most open clusters comes from 
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observations of their metallicities. Spectral studies of open-cluster stars yield 
heavy element abundances which scatter around the solar value, with metal- 
licities lying in the range —0.75 < [Fe/H] < 0.25 (see Table 6.2). Such high 
metallicities are to be expected in young objects which have formed from 
material in the Galactic disk that has already been chemically enriched by 
previous stellar generations. Since there are no very metal-poor open clus- 
ters, metallicity variations are a less important factor in interpreting the 
properties of these systems than is the case for studies of globular clusters. 

Like their globular kin, open clusters have played a key role in the de- 
velopment of our understanding of galactic astronomy. Since they contain 
bright, blue stars, open clusters are testing grounds for investigating the 
properties of high-mass stars. The spatial distribution of open clusters yields 
a useful probe of the structure of the Galactic disk: the youngest systems 
delineate the spiral structure of the Galaxy (§10.4.5) while the older systems 
trace the kinematics of the outer Galaxy (§9.2.3). Since it is believed that 
many of the stars in the disk of the Milky Way originated in open clusters, 
the properties of these systems must dictate many of the properties of the 
stellar disk as a whole. Finally, as we see in §2.2.2 and Chapter 7, the nearby 
Hyades cluster has played an anchoring role in establishing the size of the 
entire Universe. We therefore now apply some of the techniques developed 
in the previous section to the study of open clusters. 


6.2.1 Color-magnitude diagrams 


As for globular clusters, the CM diagram offers a powerful diagnostic of the 
evolutionary state of open clusters. Although generally closer to us than 
globular clusters, the depth of an open cluster along the line of sight is 
usually still very much less than the cluster’s distance, so we expect the 
sequences in its CM diagram to be sharply defined. In one respect, the 
CM diagrams of open clusters are easier to construct than those of globular 
clusters: the more diffuse nature of these systems means that the confusion 
which hampers stellar photometry in globular clusters (§6.1.1) is seldom an 
issue in studies of open clusters. However, the locations of open clusters 
close to the plane of the Galaxy mean that their CM diagrams are liable 
to be heavily contaminated by unrelated field stars, and some care must be 
taken to minimize this contamination by selecting stars on the basis of their 
kinematics, or by selecting only those stars with colors that are consistent 
with objects that have been reddened by the column of dust between us and 
the cluster. Since open clusters lie near the Galactic plane, extinction can be 
a large effect, which requires careful correction. An additional complication 
is that there is dust within these systems, so that stars within a single cluster 
suffer different amounts of extinction. 

As Figure 6.27 illustrates, once these difficulties have been overcome, 
the MS shows up clearly in the CM diagrams of open clusters. In fact, 
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Figure 6.27 CM diagram of the young open cluster NGC 4755. 
Note the extensive MS, the scattering of stars to its right at faint 
magnitudes, and the single evolved supergiant star at the top right 
of the diagram. [From data published in Sagar & Cannon (1995) 
and Dachs & Kaiser (1984)] 


this sequence is usually much more extensive than in the CM diagram of a 
globular cluster, stretching all the way up to the bright, blue stars at the top 
end of the sequence. The presence of such massive, short-lived stars implies 
that at least some of the stars in these clusters are very young. Further, the 
sharp delineation of the sequence implies that all the stars are of very similar 
age, since none of the less massive stars with longer MS lifetimes have yet 
started to turn off toward the giant branch. 

Towards the bottom of the MS in an open cluster, the sequence fre- 
quently becomes broader, with a broad scattering of stars on the red side 
(see Figure 6.27). The width of this sequence is too large to be attributed to 
observational errors. It also cannot be the result of patchy extinction, since 
we would expect such random reddening to affect the colors of some of the 
brighter stars as well, yet their part of the MS is narrow. The broadening of 
the MS at low luminosities is, in fact, a consequence of the system’s youth: 
as we have seen in §5.1.8, before they reach the MS, stars evolve from the 
Hayashi limit on the extreme right of the CM diagram over to the MS, with 
lower mass stars taking longer to make this transition. For the low mass 
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Figure 6.28 Sequences found in the CM diagrams of a number 
of open clusters, shifted to a common absolute-magnitude scale. 
{From data kindly provided by J.-C. Mermilliod] 


stars, the timescale to reach the MS is longer than the lifetime of the cluster, 
and so we observe them following their Hayashi tracks down to the MS (see 
Figure 5.10). Notice that the scattering of these stars in the CM diagram im- 
plies that they cannot all have formed simultaneously, as different stars with 
the same masses have evolved to different points in their pre-main-sequence 
evolution. 

At the upper end of the MS, the stars have started to evolve away from 
the MS, giving the sequence more curvature than the ZAMS. Subsequently, 
massive stars evolve very rapidly across the CM diagram, have very brief 
lives as supergiant stars, and explode as supernovae. The short timescale on 
which these stars pass through the giant stages of their evolution means that 
the late-evolution sequences in the CM diagram are very sparsely populated 
with a pronounced Hertzsprung gap (§3.5). 

To compare the CM diagrams of different clusters, we must place them 
on a common absolute-magnitude scale by MS fitting. The calibration of 
this scale is better established than for the globular clusters (§6.1.2) since 
there are a large number of nearby stars with measured trigonometric par- 
allaxes that have similar metallicities to the open cluster members. The 
more extensive nature of the MS in open clusters also assists in the fitting 
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process, since it is possible to fix the calibration over a wide range of colors 
and magnitudes. Further, the Hipparcos satellite has measured trigonomet- 
ric parallaxes out to distances that encompass a number of open clusters, so 
the absolute calibration of the CM diagrams for these nearby clusters can 
be obtained directly. Figure 6.28 shows schematically the sequences for a 
selection of open clusters which have been calibrated to an absolute mag- 
nitude scale. Although some of the clusters reveal very similar sequences, 
the most striking feature in Figure 6.28 is the wide range of MS turnoff 
points found in different clusters (see also Table 6.2). This variety of turnoff 
points, some of which take the form of an abrupt termination of the MS, 
contrasts markedly with the common location of a smooth turnoff point in 
the CM diagrams of globular clusters. As we shall discuss below, most of 
the differences between the sequences can be attributed to a wide range of 
ages amongst open clusters. There are, however, indications that some of 
the differences between the locations of sequences cannot be attributed to 
age or metallicity variations (Mermilliod et al. 1997) — like those of globular 
clusters (§6.1.6), open clusters CM diagrams seem to be under the influence 
of an as-yet unidentified third parameter. 


6.2.2 The ages and demise of open clusters 


Comparing Figures 6.28 and 6.6, it is apparent that the differences between 
the CM diagrams of different open clusters can be readily explained by dif- 
ferences in their ages. If we crudely apply equation (6.2) to the systems in 
Figure 6.28 (which, like most open clusters, have approximately solar metal- 
licities) we find that the observed spread in their turnoff points corresponds 
to ages that range from ~ 1 Myr all the way up to ~ 10 Gyr (see Table 6.2). 

More detailed calculations confirm this spread in cluster ages. The cur- 
rent record for the oldest reliable age determination for an open cluster is held 
by Berkeley 17, which has a turnoff point that places its age at 12 + 2 Gyr. 
[See Friel (1995) for a discussion of the properties of this and other old open 
clusters. | 

The slow approach to the MS of low-mass stars enables us to assign ages 
to very young clusters. For example, by comparing the locations of low-mass 
stars in the CM diagram to the predictions of pre-main-sequence contraction 
calculations (§5.1.8), Adams, Strom & Strom (1983) found that in the open 
cluster NGC 2264 there are low-mass stars with ages as high as 7, ~ 10 Myr. 
This estimate conflicts with the cluster age one obtains by identifying the 
turnoff from the MS (§6.1.4). Walker (1956) found that NGC 2264 has a 
sharply defined turnoff at spectral type O7. From the MS nuclear burning 
lifetime of an O7 star, we derive a cluster age 7, ~ 3Myr. In fact, it is 
generally found that cluster ages determined from low-mass stars are greater 
than those determined from high-mass stars. As Herbig (1962) first pointed 
out, this ordering implies that low-mass stars form earlier than high-mass 
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stars in these systems, although the reasons for this difference are lost in the 
details of the ill-understood process of star formation. It is, however, just 
as well that the high-mass stars form last, since these stars pass through 
their lifetimes very rapidly, and when a number of such stars explode as 
supernovae they will disperse the remaining ISM in the cluster, bringing star 
formation to an abrupt end. If these stars had been created first, low-mass 
stars would never have had the chance to form. 

As for the globular clusters, the absolute ages of individual open clusters 
are subject to significant uncertainties: using a different isochrone calcula- 
tion, different conversion from theoretical stellar structure calculations to 
observed colors and magnitudes, or different reddening or distance modulus 
can all alter the derived age considerably. For example, estimates of the 
age of the old open cluster NGC 188 have ranged from 6.5 Gyr (Twarog & 
Anthony-Twarog 1989) up to 10 Gyr (vandenBerg 1985), depending on the 
adopted reddening and distance modulus. Similarly, age estimates based 
on pre-main-sequence evolutionary tracks are also found to be significantly 
model-dependent (D’Antona & Mazzitelli 1994). However, the differences 
between the properties of different open clusters are so extreme that there 
can be no doubt that they span a very wide range of ages. There also seems 
to be remarkably little correlation between clusters’ ages and metallicities 
(see Table 6.2). 

The spread in the ages of open clusters implies that they are continuously 
forming in the disk of the Galaxy. Mermilliod (1980) compiled a reasonably 
complete database of open clusters within 750 pc of the sun, which listed 
14 open clusters younger than 0.1Gyr. This density of young open clusters 
implies that they are forming at a rate of ~ 80kpc* Gyr~!. However, as 
Figure 6.29 shows, the density of older clusters is very much lower, dropping 
to less than 1kpc~? Gyr~! for the oldest systems. 

This underabundance of old open clusters has been known about for a 
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considerable time (Oort 1958). It could be interpreted as meaning that the 
efficiency with which open clusters form has increased dramatically over the 
last billion years, but such a model would be contrived. In fact, it has also 
long been realized that open clusters are not very robust structures: tidal 
encounters with massive molecular clouds in the Galactic disk can rip them 
apart (Spitzer 1958). The deficit of old clusters can thus be simply under- 
stood if most of them have been destroyed by such encounters. Evidence in 
favor of this idea has come from the observation that old clusters are found 
preferentially at large distances from the Galactic center and far from the 
Galactic plane, where these encounters will be least frequent [van den Bergh 
& McClure (1980); see also Table 6.2]. 

Additional support for this hypothesis has come from studies of the 
morphological properties of open clusters as a function of age. Janes & Phelps 
(1994) have shown that while 49% of the old open clusters (t 2 0.8 Gyr) in 
the Lynga (1987) database are of Trumpler richness class ‘r,’ only 18% of 
the younger clusters (t S 0.8 Gyr) fall in this richest class. Further, 91% of 
the older clusters lie in Trumpler’s highest two concentration classes (I and 
II), while only 62% of the younger clusters have such high degrees of central 
concentration. Thus, it would appear that because the poorer, less centrally 
concentrated clusters are less strongly gravitationally bound, they are less 
able to withstand tidal encounters with molecular clouds, and so they do not 
survive to old age. 

We can even witness the final demise of some open clusters after they 
have been disrupted. When the cluster dissolves, its former members still 
“remember” the direction in which the cluster was moving at the time of its 
destruction, and continue to follow similar orbits in the Galaxy even though 
they are no longer gravitationally bound together. As we shall see in §10.3.2, 
it is possible to pick out such moving groups of stars which have similar 
velocities and similar metallicities even after they have spread out over large 
areas of the sky. 


6.2.3 Structure and kinematics 


We now turn to the internal structure of open clusters. The main difficulties 
in studying the dynamics of open clusters is that these objects are not such 
neat, regular systems as globular clusters. There is therefore less justification 
for modeling open clusters as spherically-symmetric systems in equilibrium. 
Nevertheless, simple King models (BT §4.4, and §6.1.10 above) have been 
shown to provide a reasonable description of the radial distribution of stars 
in a number of open clusters (e.g. King 1962, Leonard 1988). These model 
fits confirm the visual impression that open clusters are more diffuse systems 
than globular clusters: typically, they have core radii of r, ~ 1 — 2pc, and 
tidal radii of r; ~ 10 — 20 pc, giving low concentration parameters of c $1 
(c.f. Table 6.1). Note, however, that the untidy appearance of many open 
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Figure 6.30 The fraction, F’, of MS and red-giant stars in the open 
cluster Mil that lie within a projected radius R of the cluster’s 
center. [After Mathieu (1984)] 


clusters, combined with the large amount of contamination from field stars 
due to their low Galactic latitudes, mean that r; is usually poorly constrained 
by observations. The relatively low quality of the radial profile of a typical 
open cluster is such that the profile does not place very strong constraints on 
the cluster’s dynamical state. In particular, it is not possible to distinguish 
between the simple and multi-mass King models (see §6.1.10) from such data. 

Multi-mass models should be more appropriate, however, since the youth 
of open clusters implies that they contain stars with a wide range of masses. 
Further, an open cluster will evolve to a system in which different masses 
have different kinematics on a fairly short timescale. Equipartition of energy 
between stars of different masses will occur on a timescale characterized by 
the relaxation time of the cluster, tyejax. For a gravitating system made 
up of N bodies, tyelax « N/In(N) (BT §4.0.1), and so the relatively small 
numbers of stars in open clusters means that these systems should relax quite 
rapidly toward equipartition, with the low-mass stars travelling faster than 
their high-mass associates. 

As we have discussed in §6.1.10, one of the observable consequences of 
equipartition is that the more massive stars become more centrally concen- 
trated than the less massive stars. This phenomenon is apparent in a number 
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of older open clusters, where, presumably, equipartition has had time to be- 
come established. For example, Figure 6.30 illustrates the effects of mass 
segregation in the intermediate-age cluster M11. It shows the fraction of the 
total population of different types of cluster star that lie within a projected 
radius R as a function of R. It is clear that on average the fainter MS stars 
(with typical masses of ~ 2.8. Mo) lie at larger radii than the brightest MS 
stars (with typical masses of ~ 4M). What is less well understood is why 
the red giants in the cluster seem also to be amongst the least centrally- 
concentrated cluster stars. Since these stars have recently evolved from the 
bright end of the MS, we would expect their masses to be slightly in excess of 
4 Mg, and as such one would naively predict that they should be the most 
centrally-concentrated population. The number of stars involved is quite 
small, but the extended nature of the giant population does not appear to 
be a statistical fluke, since the same phenomenon has been witnessed in a 
number of clusters (Hawarden 1975). The effect may be related to the mass 
loss that stars undergo in their giant phase (see §6.1.3), since the post-mass- 
loss stars should relax into a more extended distribution appropriate to their 
reduced masses. If this explanation is valid, then the amount of mass loss 
must be large. In Figure 6.30, for example, the red-giant distribution mimics 
that of the ~ 2.8. Mo MS stars; if the two distributions are similar because 
the masses of the objects are similar, then the giant stars must have lost 
more than 1Mg from their original mass. Such losses significantly exceed 
the ~ 0.2. Mo believed to be shed by globular cluster giant stars (§6.1.3). 
Careful analysis of the CM diagrams of open clusters has uncovered evidence 
that the giant stars in these systems do, indeed, lose more mass than those in 
globular clusters (Tripicco, Dorman & Bell 1993). Since mass-loss is driven 
by luminosity, which increases strongly with mass [see equation (5.5)], it is 
entirely natural that the amount of mass a star loses in passing from the 
MS to the RGB should increase strongly with its initial mass. However, 
even if enough mass is lost by higher-mass stars, it is not clear that red- 
giant lifetimes are long enough for the spatial distribution of giants to have 
evolved significantly from that of their MS progenitors. Thus the extended 
distributions of red giants in open clusters remains problematical. 

We might hope that direct studies of the motions of stars in open clus- 
ters will clarify the significance of mass segregation. The velocity dispersions 
of these low mass systems are typically only ~ 3kms7!, so it is quite diffi- 
cult to measure their line-of-sight motions. Moreover, approximately a third 
of the stars in these systems turn out to be spectroscopic binaries (Liu, 
Janes & Bania 1989), and the velocity of a binary star’s motion (typically 
~ 20kms~+) completely masks the random motion of the star within the 
cluster unless velocities are averaged over the full binary period. A more 
profitable method for studying the kinematics of open clusters is provided 
by proper motions: since these systems are typically less than a kiloparsec 
away, the proper motions that their internal velocities induce can reach the 


readily-measurable level of ~ 1masyr~1. In a study of proper motions in 
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eight open clusters, Sagar & Bhatt (1989) found only one case (IC 2391) in 
which there was convincing evidence that the brighter (more massive) MS 
stars have smaller characteristic velocities than the fainter (less massive) MS 
stars, as we would expect if energy equipartition is occurring. In the other 
clusters in this study, there is no discernible trend in the velocity dispersion 
with stellar luminosity. These observations suggest that the stellar kinemat- 
ics of most open clusters reflect the initial conditions under which the stars 
formed rather than the effects of relaxation. It should be noted, however, 
that this conclusion runs contrary to the observed luminosity segregation in 
the cluster radial profiles; it is not yet clear how these two observations can 
be reconciled. 

As we have discussed in §6.1.11, the measurement of proper motions 
provides a valuable tool for studying the detailed orbital structure within a 
star cluster. Generally, open clusters seem to have rather simple kinemat- 
ics: none of the eight clusters studied by Sagar & Bhatt (1989) show any 
significant change in the amplitude of the stellar random motions with pro- 
jected distance from the cluster center, and so there is no evidence for the 
drop in velocity dispersion with radius that we see in globular clusters (see 
Figure 6.23). In one of the clusters in this sample (NGC 2516), the proper 
motions in the radial direction exceed those in the tangential direction at 
large radii, implying that the orbits in this system become radially-oriented 
at large radii. For the majority of clusters, however, there is no discernible 
difference between radial and tangential proper motions, and so the orbits 
seem to be more-or-less isotropic. Ultimately, limitations on such dynami- 
cal studies of open clusters arise from the relatively small numbers of stars 
in these systems, which are such that the Poisson uncertainty in kinematic 
measurements will always be quite large. Further, the lack of symmetry in 
open clusters means that simple spherical models are probably inappropri- 
ate. Finally, since the relaxation time of the poorer open clusters is not much 
longer than the time it takes a star to cross the system, the overall structure 
of these systems will be evolving quite rapidly; the sophisticated modeling 
required to include the effects of this secular evolution is beyond the scope 
of what can be achieved with the limited observational data. 


6.2.4 Luminosity function 


Some fraction of the stars in the disk of the Milky Way began their lives in 
the systems that we identify as open clusters (see Problem 6.8), and some 
further fraction must have been born in associations that would have been 
too poor to be identified as clusters. The distribution of disk-star masses is 
therefore closely coupled to the initial mass functions of open clusters and 
associations. There is no a priori reason to assume that all open clusters 
should have the same IMF: the mass function could vary wildly from cluster 
to cluster, with the mass function of the disk reflecting the sum of all the 
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cluster and association IMFs from which it was produced. We therefore now 
consider the form of the open cluster IMF, whether it varies between systems, 
and how closely it matches the IMF of field stars. 

As we have already seen in the case of globular clusters, we can only 
study the IMFs of clusters indirectly via their present-day luminosity func- 
tions. The situation is somewhat simpler for the open clusters, however, 
since we can look at young open clusters, where the effects of both stel- 
lar evolution and dynamical evolution are relatively small, so the observed 
luminosity function is closely related to the initial luminosity function and 
hence the IMF. In fact, the age window within which we can study the IMF 
directly is quite small. As we have seen in §6.2.2, high-mass stars form after 
the low-mass stars in open clusters. In order to study the complete IMF, 
we must therefore look at a cluster which has formed its O and B stars, so 
that the star formation process is essentially complete. However, if we are to 
study the high-mass end of the IMF, we must also catch the cluster before 
these O and B stars have exploded as supernovae and disappeared from the 
luminosity function. 

Studies of the high-mass end of the luminosity function have therefore 
concentrated on OB clusters and associations which, by there very nature, 
fulfill the above requirements. A study of 11 OB associations by Massey, 
Johnson & DeGioia-Eastwood (1995) found that, to within the errors due to 
finite populations, the stars in these systems with masses greater than 7 Mo 
all seem to be drawn from a common power-law IMF with index a = 2.1+40.1 
[see equation (6.4)]. Studies of OB associations in the Magellanic Clouds, 
which differ in heavy-element abundances from those in the Milky Way by 
up to a factor of ten, reveal identical high mass IMF's (Massey et al. 1995), 
and so this generic cluster IMF does not seem to vary with metallicity. Since 
the lifetimes of massive stars are very much less than the timescales on which 
open clusters dissolve, we do not expect many massive stars to escape into 
the field. Studies of the IMF of field stars do, indeed, show that it has a 
much steeper power law a ~ 4.5 [see equation (5.17)]: there are many fewer 
very massive stars in the field than there are in open clusters. 

Studies of the lower mass parts of the IMFs of open clusters are not 
easy. The already-faint low-mass stars are obscured by dust within the disk, 
and the high density of field stars can seriously contaminate the luminosity 
function. These difficulties can be minimized by looking at nearby clusters, 
for which obscuration will be smallest, and whose members can be selected 
on the basis of their large proper motions. However, nearby open clusters 
subtend large angles on the sky (see Table 6.2). Since luminosity segrega- 
tion causes low-luminosity stars to be found preferentially at large radii (see 
§6.2.3), observations must be made across the entire diameter of the cluster 
in order to obtain an unbiased sample of stars. For example, a study of 
the Pleiades by Meusinger, Schilbach & Souchay (1996) required plates that 
covered 16.5 square degrees and involved measuring proper motions for some 
49000 stars. Of these stars, only 628 turned out to be cluster members. The 
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conclusion of this painstaking analysis, presented in Figure 6.31, is that the 
IMF of the Pleiades is indistinguishable from that of the field. 

The comparison between the field and cluster IMF's is somewhat un- 
certain because the present-day distribution of stellar luminosities in the 
field reflects the star formation over an extended period — stars with masses 
S 0.8Mo have lifetimes longer than the Galactic disk and so could have 
formed at any time during its existence. Since we have little idea of how 
the star formation rate has changed over this period, there is a significant 
uncertainty here: the plethora of low mass stars in the Galaxy could have 
formed recently, in which case the IMF is very steep, or they could have 
formed much earlier, in which case fewer such stars are forming now and the 
corresponding IMF must be much flatter. This uncertainty notwithstanding, 
there is a respectable agreement between the lower-mass IMF of young open 
clusters like the Pleiades and that inferred for the field. 

Using HST, it has become possible to probe the luminosity functions 
of open clusters to even fainter limits. Figure 6.16 shows the luminosity 
functions of the open clusters, NGC 2420 and NGC 2477, both of which are 
fairly old (~ 3Gyr and ~ 1 Gyr respectively). These clusters show evidence 
for a turn-over in the luminosity function at faint magnitudes, similar to 
that observed in globular clusters and in the solar neighborhood. However, 
as we discussed in §6.1.7, this feature reflects the rapid variation in stellar 
luminosity with mass for such faint stars rather than a turn-over in the mass 
function itself. It is notable that the location of the peak in ®(M) varies from 
system to system in this plot. von Hippel et al. (1996) pointed out that these 
systems have very different metallicities, so the conversion from luminosity 
function to mass function will be different in each case. Intriguingly, the 
peaks in ®(M) all occur at stellar luminosities that correspond to masses 
of ~ 0.27 Mo, with the shifts between systems entirely explicable by the 
differences in metallicity. 
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The agreement between field and open-cluster IMF's at masses below 
~ 3Mo adds weight to the hypothesis that many, if not all, of the lower 
mass field stars were born in clusters and associations. Further support for 
this idea has come from studies of the luminosity functions of old clusters. 
As we have seen in §6.2.3, there is evidence that mass segregation occurs as 
an open cluster grows older, with the least massive stars ending up in an 
extended halo. These low-mass stars are the most likely to achieve escape 
velocity, so they should be the first cluster stars to join the field population. 
We would therefore expect the luminosity functions of old clusters to be 
depleted at the low-luminosity end. Such depletion is, indeed, observed, with 
the luminosity functions of many old clusters turning over and decreasing at 
faint magnitudes. The turn-over occurs at ~ 1M, well above the ~ 0.2 Mo 
where it might be attributed to the effects of a steep mass—luminosity relation 
(as discussed above), so it must represent a real deficit of low-mass stars in 
these systems. This phenomenon, which has been found to be widespread 
amongst old open clusters, is reviewed by Friel (1995). She also points to the 
extreme case of NGC 3680: a study of this old open cluster by Nordstrém, 
Andersen & Andersen (1996) found that only 13 of the 120 identified cluster 
members are single MS stars, with the remainder all more massive evolved 
objects. Apparently, while the higher-mass MS cluster stars have left the MS 
due to stellar evolution, the lower-mass MS stars have evaporated from the 
cluster to join the field. These two processes have almost met in the middle 
of NGC 3680’s MS, so this cluster is in the final stages of dissolution. 


Problems 


6.1 The color of a HB star can depend quite sensitively on the amount of 
mass AM it has lost. Given that the typical dispersion in mass lost is only 
~ 0.02 Mo, estimate the dependence, 0(B — V)/O(/\AM/ Mo), for M3 (see 
Figure 6.2). 


6.2 Use Figure 6.6 to estimate how rapidly the turn-off point of an evolving 
globular cluster shifts in B — V with time. Hence, show that typical residual 
color errors in isochrone fits can produce age estimates that are systemati- 
cally wrong by ~ 2 Gyr. 


6.3. By how much would the adopted value of [Fe/H] have to be wrong to 
produce a 10% error in the age of a globular cluster as estimated using the 
AV method, assuming that equation (6.3) is correct? 


6.4 Using Figures 6.14 and 6.15 and Table 6.1, estimate the variation in 
metallicity with IMF power-law index, d[Fe/H]/da, in globular clusters. As- 
sume that high-mass stars (M > 8Mo) drawn from the appropriate IMF 
formed first in these systems, and all the material produced in the super- 
novae of such stars is recycled at solar metallicity into the lower mass stars 
(0.08 < M < 8Mo), modifying the metallicity that we see today. Show that 
such recycling could go a long way toward explaining the observed variation 
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of [Fe/H] with a. Why is such efficient recycling difficult to envisage for 
globular clusters? 


6.5 Consider a globular cluster populated with stars drawn from an IMF 
of the Salpeter form [equation (5.14)] with an appropriate power law index. 
Allowing for the evolution that a globular cluster ~ 15 Gyr old will have 
undergone, show that ~ 10% of the original stellar population will be white 
dwarfs. 


6.6 Use BT Figures 4-9 and 4-10 to show how the core radius, r,, compares 
to the King radius, ro, for King models with different concentrations c = 
log(re/r<). 


6.7 Ifn points are drawn from a Gaussian distribution of dispersion o, then 
the standard deviation of the sample, S, will not be exactly equal to o. In 
fact, it will be distributed such that nS*/o? follows a x? distribution with 
n degrees of freedom. Show that the velocity dispersion as inferred from 
3 globular cluster stars drawn from a Gaussian velocity distribution with 
o = 15kms~! will lie between 5kms~! and 25kms~1! 90% of the time. How 
many stars would one have to observe to determine o to within 10% at 90% 
confidence? 


6.8 Assuming that the drop in the number of open clusters with age in 
Figure 6.29 arises from clusters dispersing into the field, and that each cluster 
initially contained ~ 10° stars, estimate the total number of stars per square 
parsec that these clusters have contributed to the field. If these stars end up 
distributed in a layer ~ 100 pc thick (comparable to the typical distances of 
open clusters from the Galactic plane), estimate the number density of these 
stars in the solar neighborhood. What fraction of nearby stars were born in 
such clusters? Why might this calculation significantly under-represent the 
total contribution made by clusters and associations to the field? 


7 


The Cosmic Distance Scale 


For several centuries, telescopes have provided us with images of the cosmos 
showing a wealth of patterns and structures. However, the absence of depth 
information in these two-dimensional images has proved a major limitation 
on our ability to understand these structures. As we saw in Chapter 1, the 
argument over the nature of “spiral nebulae” was only finally resolved when 
the large distances to some of these systems were measured. Similarly, the 
discovery of a class of faint radio-emitting star-like objects only assumed its 
true significance when it was realized that these quasars lie at vast distances 
from us (see §4.6.2). If we are to understand the physics behind an astro- 
nomical phenomenon, we need to quantify the basic properties of the system 
under observation, and many of these properties cannot be determined unless 
we know the distance to the object in question. 

Although we are not be able to determine many of the absolute proper- 
ties of astronomical phenomena without distance information, we can com- 
pare their relative properties. For example, the stars in any galaxy outside 
our own are all at essentially the same distance from us. Thus, if observa- 
tions of a nearby galaxy reveal that all the stars of a particular type have 
the same apparent brightness, then we know that this class of objects all 
have the same intrinsic luminosity. Furthermore, we can use this property to 
measure the relative distances to other galaxies: if we can identify the same 
type of star in these galaxies and measure their apparent brightnesses, then 
the ratio of distances follows directly from the ratio of apparent fluxes. In 
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terms of the apparent magnitudes, m, and mz, of the sources in two different 
galaxies, the distance ratio to the two galaxies, D;/Dz, is given by 


be DD) = : iG ae Ae. ea 


where A; — Ag corrects for possibly differing amounts of absorption towards 
the two sources, and Ky, — K allows for possible differences in their Doppler 
shifts (see §2.3.3). Sources with identical intrinsic luminosities which can be 
used in this way are known as standard candles. Similarly, if a certain class 
of objects all have the same intrinsic size, then we can estimate the relative 
distances to objects containing these standard rulers by comparing their 
apparent angular sizes. 

The classical procedure for estimating distances in the Universe typically 
uses a series of such relative distance estimators: the distance to a nearby 
galaxy might be determined by comparing the apparent brightnesses of its 
individual stars to those of similar stars in the Milky Way; the distance to 
a more remote galaxy whose individual stars cannot be resolved can then 
be measured by comparing the properties of the galaxy as a whole to the 
properties of the nearby galaxies, and so on. This bootstrapping approach to 
measuring distances has been dubbed the cosmic distance ladder, where 
each rung up the ladder takes us to greater distances. Notice, however, that 
this sequence of relative distance estimates must be calibrated by measuring 
at least one absolute distance using an absolute distance estimator. This 
absolute measurement provides the reference point that allows the series of 
relative measurements to be converted into absolute distances. 

Great efforts have been expended in making the wide range of obser- 
vations necessary to calibrate the distance ladder. Unfortunately, there are 
discrepancies in the derived distances: detailed reviews by Jacoby e¢ al. 
(1992), van den Bergh (1992), Sandage (1993b) and Fukugita, Hogan & 
Peebles (1993) show that absolute distance determinations can still have an 
uncertainty of up to a factor of two. In this chapter, we discuss the various 
techniques and observations that have been used to define the cosmic distance 
scale. These methods combine the absolute distance measurements that are 
required to tie down the overall calibration of the distance scale (§7.2), and 
the relative distance measurements that provide most of the rungs from 
which the cosmic distance ladder is made up (§7.3). In §7.4, we show how 
the methods can be combined to obtain a reasonably coherent extragalactic 
distance scale, and where discrepancies between different analyses remain. 

The cosmic distance ladder allows us to measure distances out to a 
significant fraction of the size of the observable Universe. In order to make 
sense of observations that probe these vast scales, we must first present the 
cosmological framework within which such data can be interpreted. 
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7.1 An introduction to cosmology 


One crucial cosmological result, which was originally obtained without good 
absolute distance calibration, is the Hubble law. Following on from the dis- 
covery that the characteristic lines in the spectra of almost all galaxies are 
shifted systematically toward the red (Slipher 1914), Hubble (1929) investi- 
gated the relationship between the distance, D, to a galaxy and the amount 
A> that a line of intrinsic wavelength X is shifted in its spectrum. He found 
that these two quantities are proportional to one another: 
Zz 


c 
D= Hy’ (7.2) 
where c is the speed of light and z is the redshift of the galaxy (§2.2.1). 
Without good absolute distance measurements, the Hubble constant, Ho, 
is an ill-determined constant of proportionality. As we shall see below, the 
Hubble constant is believed to have a value of somewhere between 50 and 
100 kms~! Mpc™?, and so it is frequently written in the form 


Hy = 100hkms~* Mpc“, (7.3) 


where the dimensionless quantity h parameterizes our ignorance of the Hub- 
ble constant, and probably has a true value somewhere between 0.5 and 
1.0.4 

The simplest physical interpretation of the Hubble law is that the entire 
Universe is expanding, and the recession of the galaxies gives rise to their 
redshifts via the usual Doppler formula v = cz (v < c). Running the clock 
backwards, we see that one Hubble time, 


j= Die= He. (7.4) 


ago all the galaxies must have started from the same place. Thus, the discov- 
ery of the Hubble law revolutionized cosmology by introducing a Big Bang 
at a finite time in the past when the entire Universe was concentrated in 
a tiny volume. The fact that we can estimate the age of the Universe it- 
self by determining the Hubble constant provides a powerful motivation for 
obtaining a reliable absolute calibration for the extragalactic distance scale. 

The geometry of a general-relativistic expanding Universe is intrinsically 
different from Euclidean space. We therefore have to be a little careful to 
state our assumptions about the largescale geometry of the Universe, and 
even exactly what we mean when we talk about the “distance” to an object. 
A complete discussion of the geometry of the Universe lies beyond the scope 
of this book; here we simply sketch out the results that we will need to 
interpret cosmological observations. More detailed treatments can be found 
in Weinberg (1972), Peebles (1993), or Longair (1992). 


1 For SI purists, Ho = 3.2 x 10-!8hs—! = 3.2haHz. 
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The solutions to Einstein’s general relativistic field equations depend 
on the distribution of mass in the system under consideration. Clearly, at- 
tempting to solve these equations for the entire Universe is a formidable 
undertaking. Light emitted in the Universe is strongly inhomogeneous, in 
that it mostly comes from the small regions that we call galaxies. Galaxies, 
in turn, are arranged into clusters and still larger structures. If this distri- 
bution of light-emitting material reflects the distribution of mass, then the 
detailed geometry of the Universe is extremely complex. However, on the 
largest scales the Universe does appear to be reasonably homogeneous. We 
might therefore hope to be able to describe its gross properties by solving 
Binstein’s equations assuming a homogeneous mass distribution. Since there 
are no observations that seem to pick out preferred directions in the Universe 
on these largest scales, we might also assume that the Universe is expand- 
ing isotropically. Under these assumptions, the geometry of the Universe is 
greatly simplified, and can be described by the Robertson-Walker met- 
ric. The simplest solutions to the Einstein equations for such a geometry 
are known as Friedmann cosmological models. 

These simple homogeneous cosmological models are fully defined by 
specifying the current expansion rate of the Universe (as quantified by the 
Hubble constant) and the current mass density of the Universe, pg. The fu- 
ture fate of the Universe hangs upon the value of pp. The simplest solutions 
to the field equations? imply that if pg exceeds the critical density, 


_ 3H6 


= 5a = 188 x 107-6 h? kg m3, (7.5) 


Pe 
then there is sufficient mass in the Universe for its gravitational influence to 
ultimately halt the expansion, and for the Universe to collapse back towards 
a final “Big Crunch.” Such a system is called a closed or bound model. 
If, on the other hand, po < pc, there is insufficient mass to ever halt the 
expansion, and we refer to such an eternally-expanding system as an open 
or unbound model. If p is exactly equal to p,, the Universe will just 
expand forever, but it’s expansion velocity will tend asymptotically toward 
zero; such a system is termed a critical or Einstein-de Sitter model. 

There are two dimensionless forms in which the present density of the 
Universe, and hence its ultimate fate, are often cast. The cosmic density 


parameter, 

_ po _ 8G po 
Qo — eed 2? 
Pe 3H6 
simply measures the density in units of the critical value. Thus, the fate 
of the Universe depends on whether or not 29 exceeds unity. The second, 


(7.6) 


2 These simple models assume that there is no pressure helping to support the Universe 
against collapse. Cosmologists sometimes refer to them as “dust” models to make the point 
that both gas and radiation produce pressure, so a Universe containing anything besides 
pressureless dust is not strictly represented by such a model. 
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slightly more abstract, way in which the density of the Universe can be quan- 
tified is the cosmic deceleration parameter, go. This parameter specifies 
in dimensionless form the rate at which the expansion of the Universe is cur- 
rently decelerating. Clearly, this quantity is closely tied to the value of the 
mass density responsible for this retardation. In fact, for the Friedmann cos- 
mological models considered here, the relation takes the exceptionally simple 
form go = Qo/2, and so the critical value for the deceleration parameter is 
do = 0.5. 

Once a particular cosmological model has been adopted by specifying 
values for Hg and qo, we can define what we mean by “distance.” If, for 
example, we define the distance to an object of intrinsic luminosity L as the 
quantity D such that the observed flux from the source is f = L/(41D?), 
then in the standard Robertson-Walker metric this luminosity distance 
for a source at redshift z is given by 


cz z(1— qo) 


D= — |1+ ——-—_——_—_—_ 
Ho (1+ 2qoz)!/2 +14 qoz 


(7.7) 


Note that the luminosity referred to here is the bolometric value. Unfortu- 
nately, what we actually measure when we observe some standard candle at 
high redshift is its flux within a fixed bandpass. There is a serious danger 
that uncertainties in the K-correction needed to compensate for the object’s 
redshift (§2.3.3) will render its luminosity distance unmeasurable. 

Alternatively, we can define the angular diameter distance, D, such 
that a source of intrinsic size d subtends an angle 6@ = tan~1(d/D) =~ d/D 
(D > d). We then obtain a different distance measure, 


D=(1+2)°D. (7.8) 


One counter-intuitive implication of equation (7.8) is that it does not vary 
monotonically with z: if we imagine moving an object to progressively higher 
redshifts, its angular size at first decreases as we would expect, but ultimately 
it starts to increase with redshift! This magnification at high redshift occurs 
because light travelling from the source to us follows a bent path in the non- 
Buclidean geometry of the Universe, distorting the image of the source — it 
is, in fact, a grand-scale example of gravitational lensing (Appendix A). The 
departures in the various distance measures from the simple linear relation 
between D and z of equation (7.2) mean that we can use the observed relation 
between redshift and distance as estimated by apparent luminosity or angu- 
lar size to measure qo (see §7.4.4). In principle, we can therefore use standard 
candles or rulers to derive the large-scale mass density of the Universe. Note, 
however, that equations (7.7) and (7.8) are only strictly valid for the Fried- 
mann models in which mass density is distributed uniformly through space. 
In reality, we know that at least some fraction of the material in the Universe 
is concentrated into galaxies and other discrete structures. As we will see 
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in §7.2.4, these discrete mass concentrations also bend the path travelled by 
light, distorting the brightnesses and sizes of background objects. If a signif- 
icant fraction of the mass of the Universe is non-uniformly distributed, then 
the distorting effects of discrete objects may dominate the departures from 
the simple Hubble law caused by the large-scale geometry of the Universe, 
complicating the calculation of the total mass density. 


7.2 Absolute distance estimators 


We have already discussed absolute distance measurements for nearby stars 
using simple geometric techniques — the available method include trigono- 
metric parallax (§2.1.3), the moving cluster method (§2.2.2), secular parallax 
(§2.2.3) and statistical parallax (§2.2.4). The distances to these objects pro- 
vide an absolute calibration to the relative distance estimators in the cosmic 
distance ladder, but the number of steps required to get from these local 
stellar measurements to extragalactic distances is undesirably large. In this 
section, we present a number of methods for calculating absolute distances 
which can be applied to objects at much greater distances. The successful 
application of these methods would allow us to tie down the absolute cali- 
bration of extragalactic distances on a variety of scales. In principle, these 
measurements offer a much more robust framework of distance determina- 
tion than is available when the cosmic distance ladder is anchored solely by 
absolute distances to nearby stars. 

As we will see, the major drawback to these absolute methods is that 
they suffer from a variety of potentially large systematic errors. If the dis- 
tance scale calibration is to be effective, we must characterize these system- 
atic effects as far as possible, and compare the results from all the different 
methods (which have different possible sources of systematic error) to check 
for consistency. 


7.2.1 The Baade—Wesselink method 


If we know a star’s radius, R, then it is straightforward to estimate its 
distance from us. From its optical color or spectral shape, we can measure 
the effective temperature of the star (§3.4); its intrinsic luminosity is then 
given by 

L=4nR?oTs (7.9) 


[see equation (3.15)], or in terms of the star’s bolometric absolute magnitude, 
Mpoi = —10 log Tog — Slog R+C, (7.10) 


where C is a known constant. By comparing the apparent magnitude of the 
star to its absolute magnitude, we have a direct measurement of the star’s 
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distance from us (§2.3.3). Unfortunately, as we saw in §3.2, it is not usually 
possible to measure stellar radii directly, and those measurements which have 
been made give the radii of nearby stars which are not much help in setting 
the cosmic distance scale.3 

This problem can be overcome through studies of variable stars such 
as Cepheids and RR Lyrae stars (§5.1.10). Lines in the spectrum of such a 
variable star show Doppler shifts that vary cyclically with the same period 
as the star’s brightness variations. As we have seen in §5.1.10, both the 
brightness and velocity variations arise because the atmosphere of the star is 
unstable and undergoes global expansion and contraction, changing the size 
of the star. The amount by which the size of the star changes between times 
to and t, can be measured by simply integrating the line-of-sight velocity of 
the atmosphere (as measured from Doppler shifts in its spectra) over this 
time: 


ty 
Ar, = -» f Vios(t)dt. (7.11) 
to 

The negative sign occurs because Ar; and vjs have opposite signs: an ex- 
panding atmosphere will blueshift the lines in the spectra resulting in a 
negative value for vjo,. The factor of p arises because the expansion of the 
star’s atmosphere does not all occur along the line of sight — at the outer 
limb of the star, for example, the expansion velocity is transverse to the line 
of sight. The observed mean line-of-sight velocity is therefore less than the 
true expansion velocity of the atmosphere (see Problem 7.2). 

Thus, if the star starts out with (unknown) radius ro at time to, then by 
time ¢, its size will have changed by a directly measurable amount, and its 
radius will now be ro + Ar;. From equations (7.10) and (2.36), the change 
in the apparent magnitude of the source over this time will be 


Mbol1 — Mbolo = Mbot1 — Moboto 
= —5 [log(ro + Ar) — log ro] — 10 [log Tegy — log Tero] - 

(7.12) 

If we can measure the effective temperature of the source at times to and ft; 

as well as the change in its apparent bolometric magnitude over this interval, 

then we can solve for ro, the only unknown in equation (7.12). A particu- 

larly simple solution can be obtained if we select the time t, for our second 

observation to be a moment at which the star’s effective temperature is the 

same as it was at time to. In this case, we find 


_ HAr, 
1-H 


To , where H = 1Q(™Mvo11 —™bo10)/5 (7.13) 


3 One method for measuring stellar radii which can, in principle, be extended to large 
distances comes from observations of eclipsing binaries (see §3.2.5). Such systems are 
rare and rather faint when they lie at interestingly-large distances, but the microlensing 
studies monitoring large numbers of stars in nearby galaxies (§2.4) are now turning up 
these objects in usable quantities. 


7.2 Absolute methods 401 


A method for obtaining the distance to a pulsating star is now clear. 
By obtaining spectra throughout its period of oscillation and measuring the 
Doppler shifts in the spectral lines, we can derive vjo,(t). By also making 
a series of measurements of the apparent magnitude of the star in several 
bandpasses, we can estimate the variation in apparent bolometric magnitude 
with time (§2.3.4) once we have corrected the observations for the effects of 
extinction (§3.7.1). For a star of a particular type (dwarf, giant, etc), effec- 
tive temperature varies monotonically with stellar color (see, for example, 
Table 3.6). Thus, by selecting data from two parts of the oscillation cycle at 
which the star has identical colors, we can pick out pairs of observations with 
the same effective temperature (without having to know anything about the 
details of the relationship between color and effective temperature). Calcu- 
lating values for Ar for each such pair using equation (7.11), and substituting 
these values together with the bolometric magnitudes into equation (7.13) 
allows us to obtain a series of estimates for ro. Combining the best estimate 
for ro with equation (7.11) then tells us the radius of the star at any point 
during its oscillation cycle. Since we now know the radius of the star, it is 
straightforward to use the method outlined at the beginning of this section 
to calculate its distance from us. This approach, suggested by Baade (1926), 
was developed in the form described above by Wesselink (1946). All similar 
techniques for measuring distances based on variations in radial velocity and 
apparent magnitude are known generically as Baade—Wesselink methods; 
a review of this approach to measuring distances can be found in Gautschy 
(1987). 

The classical Baade~Wesselink method described above has a number 
of shortcomings. First, real data will have observational errors associated 
with them, and it is not clear how the different estimates for ro obtained 
from different pairs of observations should be weighted when combining them 
to derive an overall “best” estimate (Balona 1977). More fundamentally, 
some of the assumptions that went into deriving the method may not hold. 
Specifically: 

(1) We have assumed that the velocity derived from spectral line shifts can 
be used to characterize the overall expansion of the star. A real stellar 
atmosphere is not a single opaque surface, and since its opacity varies 
with wavelength, the depth into the atmosphere from which we observe 
most of the continuum emission may be different from the depth at 
which the spectral lines are formed. A non-uniform expansion in the 
atmosphere would mean that these two regions could be moving with 
different radial velocities. Since the structure of the atmosphere changes 
over the star’s oscillation cycle, it is unlikely that there is a simple 
relation between these two velocities. 

The stellar oscillations may not be purely radial. Many types of vari- 
able star are believed to undergo oscillations which involve their shapes 
departing from spherical symmetry. The application of the Baade— 
Wesselink method to such stars is complicated because they can no 
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longer be described by a single radius and a single radial expansion ve- 

locity at any given time. The Baade-Wesselink method can only be 

extended to allow for these non-radial effects if the shape of the stellar 

oscillation is known (Balona & Stobie 1979). 

(3) Finally, we have assumed that stellar color is a reliable measure of effec- 
tive temperature. In fact, observations of variable stars using a range of 
optical filters show that color indices centered on different wavelengths 
(e.g. U ~ B, V — R) do not all vary in the same way with time. Hence 
there is clearly not a universal one-to-one relationship between all these 
optical colors and effective temperature, and it is not obvious which 
measure of color best specifies the effective temperature. Given the 
non-standard colors of these variable stars, it is also unclear how one 
converts the apparent magnitudes in different wavebands into the req- 
uisite bolometric magnitude. 

Although these problems complicate the Baade—Wesselink analysis, they 
are not insurmountable. Recent applications of the method have concen- 
trated on modeling the atmospheres of pulsating stars in some detail. From 
these models, we can predict the observable photometric and spectral proper- 
ties of the star in order to quantify the systematic errors that are introduced 
into the Baade-Wesselink method by the detailed physics of variable stars. 
Ultimately, directly fitting fully time-dependent models of variable star at- 
mospheres to the observed spectra and photometry of these systems offers 
the most complete approach to calculating distances to variable stars. How- 
ever, the physical reason why such methods work is captured in the classical 
Baade—Wesselink analysis: Doppler velocities allow us to measure a change 
in the linear size of the source, and measurements of colors and apparent 
magnitude provide us with a measure of the fractional change in its size; 
putting these two constraints together yields the linear size of the source, 
and hence its absolute luminosity and distance. 


Application to supernovae Thus far, we have discussed the Baade- 
Wesselink method in terms of its application to variable stars. The same 
technique can be applied to any radiating system whose size is changing 
with time, and supernovae are obvious candidates for such studies. These 
exploding stars (§5.2.3) are enormously bright and so can be observed out 
to very large distances; obtaining a reliable measurement of their distances 
would thus provide a giant leap toward establishing the extragalactic distance 
scale. 

The colors of a supernova vary monotonically as it expands and cools, 
and so we cannot employ the Wesselink (1946) trick of selecting pairs of ob- 
servations at different times which have the same colors. Instead, we must 
assume a particular relation between color and effective temperature in order 
to use pairs of observations to solve equation (7.12) for ro and hence the dis- 
tance to the supernova. We might, for example, assume that the expanding 
supernova radiates as a black body, and hence relate measurements of its 
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color directly to its temperature. By adopting this simple model, Branch 
& Patchett (1973) and Kirshner & Kwan (1974) made the first direct esti- 
mates for the distances to reasonably remote galaxies. Subsequent analyses 
used more complicated models for the expanding supernova atmospheres, 
and also started to include some of the physics of the supernova explosion 
via hydrodynamical models (Schurmann, Arnett & Falk 1979), and detailed 
model-atmosphere calculations (Schmidt, Kirshner & Eastman 1992). 

The detonation of Supernova 1987A in the Large Magellanic Cloud 
(LMC) provided an ideal opportunity for testing the Baade~Wesselink meth- 
od, since the distance to the LMC is already well-calibrated by other methods 
(§7.4.1). Further, the quality of data available for such a nearby supernova is 
far superior to that for more distant supernovae, and we have access to infor- 
mation — such as the exact time of detonation and the type of the progenitor 
star — which is usually unobtainable. Approximating the atmosphere of the 
supernova by a black body, Branch (1987) used the Baade—Wesselink method 
to obtain a distance to the LMC of 55+5 kpc, in good agreement with other 
estimates. However, Schmutz et al. (1990) have made a detailed model for 
this supernova explosion, and they find that this agreement is somewhat for- 
tuitous, due to the chance cancellation of several systematic errors. They also 
discovered that if the progenitor star had been a red supergiant rather than 
a blue supergiant, then this cancellation would not have occurred, and the 
simple Baade—Wesselink analysis would have produced a distance estimator 
that was wrong by a factor of two. Since we are very unlikely to be able to 
identify the progenitors of more distant supernovae, these calculations cast 
some doubt on the suitability of the simplest Baade—Wesselink analyses for 
determining the extragalactic distance scale. 


7.2.2 The Sunyaev—Zel’dovich effect 


The observed cosmic microwave background arises from the highly-redshifted 
thermal emission from the time when, not long after the Big Bang, the Uni- 
verse became optically thin. Most observed background photons have trav- 
elled undisturbed since this time, and so we receive a very uniform illumina- 
tion by microwave radiation from all directions. However, a small fraction of 
this light will have interacted with other material in the Universe between its 
creation at very high redshift and its arrival on Earth. In particular, photons 
that have passed through rich clusters of galaxies may have been Compton- 
scattered by electrons in the hot gas that pervades these systems. Since the 
electrons are more energetic than the photons, the photons will usually have 
their energies increased by the interaction, and so the light will on average 
be shifted to higher frequencies. Thus, the location of the peak in the black 
body spectrum of the undisturbed background radiation will be shifted to 
higher frequencies. Observations of the background radiation seen through 
a rich cluster of galaxies should therefore reveal an excess of photons at fre- 
quencies higher than the peak in the undistorted background spectrum, and 
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a deficit at lower frequencies (Zel’dovich & Sunyaev 1969). This distortion 
in the microwave background is known as the Sunyaev—Zel’dovich or SZ 
effect. The size of the distortion depends only on the probability that a 
given photon is Compton scattered, Psz = 1 — e~752, where 


Tsz = ar | dsne(s) (7.14) 


is the optical depth for scattering as the photon moves along the path S 
through the cluster, n_ is the electron density in the cluster’s plasma, and op 
is the Thomson scattering cross-section. Since the distance to the cluster does 
not appear in equation (7.14), the distortion of the microwave background 
enables us to obtain a distance-independent constraint on the distribution of 
electrons in the intervening cluster. 

We can exploit this phenomenon to measure the distance to the cluster 
by comparing the distance-independent observation to a distance-dependent 
measure of the electron density. Such a measure is provided by X-ray ob- 
servations: as electrons scatter off protons in the hot cluster plasma, they 
emit X-rays via the Bremsstrahlung process (§8.1.2). The Bremsstrahlung 
emissivity of the gas at frequency v is 


e(v) = An2Ty Web/FT x, (7.15) 


where Tx is the temperature of the plasma, and A is a constant (see §8.1.2). 
By observing the X-ray emission at a range of frequencies, we can measure 
the spectrum produced by the hot gas, and thus obtain its temperature. The 
observed X-ray flux then depends only on the electron density and our dis- 
tance from the cluster. Thus, combining this observation with the distance- 
independent measure of electron density provided by the Sunyaev—Zel’dovich 
effect, we can solve for the only remaining unknown quantity, the distance 
to the cluster. 

To illustrate this method, consider a simple model for a cluster in which 
the cluster gas is isothermal and fills a uniform-density sphere of radius r, 
lying a distance D from us. The optical depth measured by the SZ ef- 
fect through the middle of such a cluster is 73z = 2oprene. The observed 
X-ray flux is simply fx(v) = ¢mr3e(v)/(4nD?), and with an imaging X- 
ray telescope we can also measure the angular extent of the X-ray emis- 
sion, 66x = 2r,/Dradians. Combining these observable quantities with 
equation (7.15) and using the definition of angular diameter distance [equa- 
tion (7.8)], we find that a cluster at redshift z lies at a luminosity distance 


A pe f2 naked: OOK, oe 
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Thus, once the redshift has been measured using optical spectra of galaxies 
in the cluster, we can combine the X-ray observations and the Sunyaev— 
Zel'dovich measurements to solve directly for the cluster distance. In reality, 
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of course, the electron density and temperature will vary with position in the 
cluster, but Silk & White (1978) have shown how detailed spatially-resolved 
X-ray observations can be used to allow for this greater degree of generality, 
subject only to the assumption that the cluster is spherically symmetric. 

Here, then, is a simple method that enables us to calculate extragalactic 
distances directly. It relies on relatively straightforward observations and 
their interpretation using well-understood physics. The only dubious step 
in the process is the assumption of spherical symmetry for the cluster gas. 
This assumption is central to the method, since the distance is essentially 
determined by equating the length through the cluster (as measured by the 
SZ effect) to the angular scale across the cluster (as measured from X-ray 
images). Unfortunately, X-ray images of rich clusters reveal that they are 
far from spherically-symmetric: some have no clearly defined shape, while 
others appear in projection as reasonably regular ellipses. Where an elliptical 
shape does provide a reasonable fit, the mean ratio between the long and the 
short axes is a/b ~ 1.5 (Struble & Ftaclas 1994). If such an elliptical cluster 
were viewed from different angles, the path length through the cluster would 
change, leading to values of Tsz that also vary by a factor of a/b. If we were to 
assume incorrectly that the cluster under scrutiny was spherically symmetric, 
then the distance estimate derived from the above analysis could also be off 
by a factor of up to a/b (See Problem 7.3), and so there is considerable 
uncertainty inherent in this method for determining distances. 

It is tempting to suggest that this difficulty could be overcome by ob- 
serving a random sample of clusters. Although the individual distance de- 
terminations would have large uncertainties due to the unknown asphericity 
of the cluster, it should be possible to obtain the correct answer on average, 
and thus calibrate the extragalactic distance scale. However, observational 
bias makes it difficult to select an appropriate random sample. For a sample 
of clusters of a given intrinsic richness and ellipticity, those that happen to lie 
with their longest axis along the line of sight will produce the most detectable 
X-ray emission, since their gas will appear concentrated in the smallest re- 
gion. Such end-on clusters will also have the largest SZ effects due to the 
long path length through these systems. When analyzing samples of clusters, 
great care must be taken to ensure that we are not preferentially selecting 
these end-on systems for which the estimated distances will be systematically 
too large. In this context, it is interesting to note that Sunyaev—Zel’dovich 
distance estimates tend to be larger than the values obtained using other 
methods (e.g. McHardy et al. 1990). 


7.2.3 Distances from time delays 


If an astrophysical source’s light output changes with time, then we may be 
able to estimate its distance by measuring a delay in the arrival of the time- 
varying signal. The details of the method depend on the physical reason for 
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Figure 7.1 Diagram showing the time delay in the signal received from SN 1987A due 
to reprocessing by material located in a ring of radius r,jng at an inclination ¢ to the line 
of sight. The Hubble Space Telescope direct image of the inclined ring is shown on the 
left of the figure; note also the two fainter rings which imply that the full geometry of the 
system is more complex. [HST image reproduced with permission from AURA/STSc]] 


the time delay, but essentially all such methods use the observed delay, At, 
in the arrival of a signal traveling at the speed of light to obtain a measure 
of a physical dimension of the source, d ~ cAt. If we can also measure the 
same characteristic scale in angular coordinates, 9, derived from the size of 
the source on the sky, then we have a direct measure of the distance to the 
source, D ~ cAt/0, where @ is measured in radians. 


The ring around Supernova 1987A A simple example of distance de- 
termination via time delay is furnished by Supernova 1987A. Observations 
using the International Ultraviolet Explorer (IUE) satellite revealed narrow 
emission lines from highly ionized atoms in the spectrum of the supernova. 
These lines were only detected at a time to ~ 90 days after the initial explo- 
sion, and their intensities rose to a maximum after tmax * 400 days, before 
decaying slowly back down to very low flux levels. The simplest explanation 
for this delayed light curve is that the lines arise from radiation that has been 
reprocessed by material at some distance from the supernova, with the delay 
arising from the extra light travel time involved in this detour. Support for 
this hypothesis comes from a Hubble Space Telescope image of the super- 
nova obtained through a narrow-band filter which admits light produced in 
a line of ionized oxygen (see Figure 7.1). This image shows that the emission 
from this ion is concentrated in several rings, the brightest of which forms 
an ellipse centered on the supernova. An inclined circular ring of material 
around the supernova would have this elliptical appearance and might also 
explain the delayed light curve. 

The geometry of the proposed tilted ring is illustrated in Figure 7.1. 
The first reprocessed light to reach us would travel via path B, which, by 
simple geometry, would be delayed as compared to the direct path A by an 
amount sc 

to = a — sini), (7.17) 


where 2 is the inclination of the ring, and rying is its radius. The strength 
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of the line emission would then continue to rise until the entire ring is con- 
tributing to the reprocessed light output. The last point to “light up” around 
the ring would be the point which is illuminated only via the longest path, 
C. The extra time delay to this point is 


tax = 22 (1 + sini). (7.18) 
Cc 


Combining equations (7.17) and (7.18) with the observed values of to and 
tmax, We can solve simultaneously for i and ring; this analysis yields i = (42+ 
5)° and rring = (0.420.03) pc (Panagia et al. 1991). An entirely independent 
measure of the ring’s inclination comes from its apparent ellipticity in the 
HST image, and these data yield an inclination of ¢ = (43 + 3)° (Jakobsen 
et al. 1991); the close agreement between these two results provides some 
confidence that our assumed geometry of a simple circular ring is correct. 
The major axis of the elliptical ring in the HST image has an angular size 
of 6 = 1.66 + 0.03 arcsec; equating this value to the derived linear diameter 
of the ring, rying, implies that the distance to the supernova is 52 + 3 kpc in 
good agreement with other estimates (see §7.4.1). 


Gravitational lens time delays The measurement of distances by time 
delay can be extended to very large scales by observing gravitationally-lensed 
quasars. As we have discussed in §2.4, when light passes a massive body, its 
path is bent by gravity. If the light in question comes from a distant quasar, 
and the mass is a galaxy or even a whole cluster of galaxies, then the effects 
of this gravitational lensing can be quite dramatic. Such large masses are 
sufficient to separate lens-generated multiple images by several arcseconds, 
allowing the individual images to be resolved (see Figure 7.2). Further, the 
more complicated geometry of a galaxy’s mass distribution (as compared to 
the point-mass lens discussed in §2.4) means that multiple images containing 
four components can also be produced.* To-date, around ten quasars have 
been found to be split into such double or quadruple images; a review of 
these and other cosmological gravitational lensing phenomena can be found 
in Blandford & Narayan (1992), and the entire subject is presented in great 
detail by Schneider, Ehlers & Falco (1992). 

The bending of the light path by the lens increases the distance that the 
light has to travel to reach us, so there is a time delay in the signals that we 
receive. If the image of a background source at redshift z, appears shifted 
by an angle a; due to gravitational lensing by a deflecting mass at redshift 
Za, then the extra delay can be shown to be 


i 1 DaD, 2 
ee = (1 + 2d) E o a? = =f ds H8)| (7.19) 


4Tn fact, such quadruple images contain a fifth component, but this last, central image 
is heavily demagnified, so it is extremely hard to detect it. 
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Figure 7.2 The basic ray geometry of a gravitational lens system, showing how the 
bending of light rays by an intervening mass can produce multiple images of a background 
object which are offset from the object’s true location. The inset image shows an example 
of this phenomenon, the “Twin Quasar” QSO 0957+561. [Image obtained by K. Kuijken 
and M. Merrifield] 


(Schneider, Ehlers & Falco (1992); see also Appendix A]. The quantities 
Da and D, are the angular diameter distances to the deflecting lens and the 
source respectively, and Dg, is the angular diameter distance to the source as 
measured from the lens (see Figure 7.2). The integral of the Newtonian grav- 
itational potential ® is evaluated along the path +; traversed by the light that 
produces this image. The first term on the right-hand side of equation (7.19) 
is simply the extra travel time due to the change in geometric distance that 
the gravitationally deflected light covers in getting from source to observer 
(see Figure 7.2); the second term arises from the general-relativistic time di- 
lation caused by the distortion of space-time in the presence of a gravitational 
field. 

Since we cannot remove the gravitational lens to measure the undeflected 


light path, ae is not an observable quantity. However, if the quasar has 


been split into two images, then each of the images will be formed by light 
that has taken a different path and will have its own time delay, 10 (see 
Figure 7.2). The intrinsic light output from quasars varies with time, so we 
can obtain the difference between the time delays for two different quasar 
images, 

Ath?) — xQ) _ 4 


lens lens lens 


(7.20) 


by observing the brightnesses of the images as a function of time and mea- 
suring the delay by which one of these functions lags behind the other. The 
time by which one image lags the other is typically a number of years, so 
a lensed quasar must be monitored over a long period if this delay is to be 
measured. 

If we wish to use such relative time delays to obtain a quantitative es- 
timate of distances, we require a detailed model for the distribution of mass 
in the gravitational lens. In general, this model will be quite complicated, 
since there is no reason why the lensing mass should be symmetrically ar- 
ranged, and there may be several components (such as a single galaxy and 


7.2 Absolute methods 409 


a surrounding cluster of galaxies) which all contribute to the total mass dis- 
tribution of the lens. Fortunately, we also have a number of observational 
constraints on the mass distribution: different mass models would produce 
very different arrangements on the sky for the multiple image locations and 
their relative brightnesses, so the range of possible lenses that could produce 
a given multiply-imaged quasar is relatively small (e.g. Falco, Gorenstein & 
Shapiro 1991). We may also be able to observe the lensing system directly. 
If the lens is a single galaxy, for example, then we can constrain our model 
by measuring its light distribution, and by using absorption line spectra to 
measure its kinematic properties (Chapter 11). 

Once we have a model for the lens, we can derive values for the angles 
by which the quasar images have been shifted, a;, and the path integral 
in equation (7.19). If we have also obtained a measurement of the time 
delay by monitoring the light output of two quasar images, then the only 
remaining unknowns in equation (7.20) are the angular diameter distances 
to the source and deflecting gravitational lens, D, and Dg. Since we can 
measure the redshifts of the source and lens, the only remaining unknowns in 
these distances are the Hubble constant, Ho, and the deceleration parameter, 
go [see equations (7.7) and (7.8)]. Thus by assuming a density model for the 
Universe (and hence fixing go), we can use the observed properties of the lens 
to solve uniquely for Hp. Alternatively, if similar observations were made of 
a second gravitationally lensed system then we could pool the results to solve 
simultaneously for both Hp and qo. 

In practice, of course, life is never so simple. Even obtaining the basic 
measurement on which the method depends — the difference in time delay 
between images — is not straightforward. For example, in the case of the best- 
studied multiply-imaged quasar, QSO 0957+561, which has been monitored 
fairly closely over a period of more than ten years, there has been a long- 
running debate as to the time delay between the two images. Estimates vary 
between 410 + 20 days (Pelt et al. 1994) and 540 + 10 days (Press, Rybicki 
& Hewitt 1992). These discrepant estimates arise because the two images 
do not vary in the same way: although many of the variations observed in 
one image are also seen in the second image ~ 500 days later, other features 
occur in only one of the images. Clearly, these non-repeated features are not 
consistent with the simple gravitational lens model for such systems. One 
plausible explanation for the differences between the two images is that in- 
dividual stars in the lensing galaxy occasionally pass through the light path 
from one or other of the quasar images, producing an additional amplifica- 
tion in the brightness of the image due to the microlensing influence of the 
star (Kayser, Refsdal & Stabell (1986); see §2.4). Fortunately, continuing 
observations of this source have now turned up a sharp spike in the lumi- 
nosity of the secondary image 415 days after an identical feature occurred in 
the primary image (Kundié et al. 1997). The absence of any similar feature 
in the secondary image with a time lag of anywhere near 540 days all but 
tules out the longer time delay. From the amount of effort invested in this 
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single measurement, it is clear that obtaining gravitational lens time delays 
is a difficult problem for even the simplest, best-studied systems. 

A further complication is that just measuring the positions and relative 
brightnesses of the images is not, in general, sufficient to determine the lens 
model uniquely. Kochaneck (1991) has shown that plausible mass models for 
the QSO 0957+561 lens, which all satisfactorily reproduce the basic image 
properties, yield values of Hp that are uncertain by a factor of ten. It is also 
quite possible that small-scale inhomogeneities in the cosmological distribu- 
tion of mass along the line-of-sight to the quasar will produce additional 
gravitational lensing effects which are difficult to characterize in a simple 
lens model. Fortunately, additional constraints on the lens mass distribu- 
tion have come from subtler properties of the images: Garrett et al. (1994) 
have obtained very high resolution radio observations of the QSO 0957+561 
system, which resolve features in the individual images of the background 
quasar, showing a radio core and jet structure. By requiring the lens to 
reproduce the details of the individual distorted images, the range of pos- 
sible mass distributions is dramatically reduced. A similar increase in the 
number of constraints on the lens mass can come from studying more com- 
plex gravitationally-lensed systems, such as the quadruply-imaged “Einstein 
Cross”, QSO 2237+031 (Yee 1988). The positions and relative brightness of 
four images provide many more constraints on the lens model than does a 
two-component image. Moreover, in such a quadruple system there are three 
independent relative time delays that might be measured. 

Clearly, difficult observations and modeling are required in order to use 
gravitational lens time delays to measure distances. However, the underly- 
ing physics — the gravitational bending of light rays — is both simple and 
well-understood. Furthermore, these systems can be seen right across the 
Universe, and so they provide direct distance determinations on the largest 
possible scales. These potential benefits more than offset the difficulty in 
making the requisite measurements, so the investment of effort will be well 
rewarded. 


7.2.4 Water-maser proper-motions by VLBI 


In §2.2.4, we studied the method of statistical parallax in which the distance 
to a set of objects is estimated by comparing their average line-of-sight ve- 
locities with their average proper motions. In the stellar case, the difficulty 
of obtaining accurate proper motions restricts the application of this tech- 
nique to those stars which lie closer than ~ 500 pc. To extend this method 
to extragalactic distances, we have to meet the following requirements: we 
must study sources that are bright enough to be readily detectable at large 
distances; their line-of-sight velocities must be determinable with reasonable 
accuracy; and we must observe them using a technique that allows us to 
measure very small changes in their angular positions. 
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All of these criteria can be met by studying H2O maser spots. These 
sources, which occur preferentially in regions of active star formation, emit 
intense line radiation at a rest wavelength of X = 1.35cm. This emission 
arises from a molecular transition in water molecules, which is excited by 
nearby newly-formed stars. The excitation produces an inversion in the 
population of the H2O molecular energy states. Any spontaneously-emitted 
photon from this material will encounter other excited molecules in its path, 
causing them to emit further photons at this wavelength; these photons, in 
turn, induce further emission in the same direction. The resulting cascade 
of coherent mono-energetic photons — a cosmic maser — produces a cone of 
tightly-beamed radio emission which is sufficiently bright to be observed at 
very large distances. Since the maser emission occurs in a narrow line, we 
can use the observed Doppler shift to measure the line-of-sight velocity of the 
emitting region with an accuracy of better than 1kms~!. Finally, because 
the emission occurs at radio wavelengths, we can use the technique of very 
long baseline interferometry (VLBI) whereby simultaneous observations from 
radio telescopes across the earth (of diameter Dearth ~ 10’ m) are combined 
to produce images with an angular resolution of ~ / Dearth radians ~ 1074 
arcseconds. Images produced using this technique show that the maser emis- 
sion occurs in discrete spots, and the positions of the brighter spots can be 
obtained to an accuracy of ~ 1075 arcseconds by measuring the centroid of 
the unresolved emission. 

As we shall see in 87.4.1, studies of the statistical parallax of HzO masers 
in star forming regions near the Galactic center provide the only absolute de- 
termination of the Sun’s radius in the Galaxy. Attempts have also been made 
to push this technique to extragalactic distances. Greenhill et al. (1993) have 
observed H2O masers in M33 at two epochs separated by t = 479 days. Some 
maser spots were detected in the data from both epochs, and they had moved 
on average by a RMS distance of 69 ~ 8 x 10~® arcseconds. The Doppler 
velocities of the spots within each star-forming region imply that they have a 
line-of-sight velocity dispersion of 0,5 ~ 17kms~1. Equating the transverse 
RMS motion to the line-of-sight velocity dispersion, we obtain a distance 
estimate for M33 of D = o5/[60(radians)/t] ~ 600 kpc. This value is lower 
than the accepted distance of ~ 740 kpc (M33 is a companion galaxy to M31, 
whose distance has been investigated extensively; see §7.4.1). However, as 
Greenhill et al. (1993) point out, there are a large number of uncertainties 
in this analysis. If the orbits of the masing clouds were not entirely random, 
then the average transverse and line-of-sight velocities would not be equal, 
and so the simple statistical parallax argument would break down. It is also 
quite plausible in a star-forming region that the emitting clouds might have a 
net outward streaming velocity, which would again invalidate the statistical 
parallax technique. By fitting a complete dynamical model to the observed 
motion, we can in principle correct for these systematic effects — see Genzel 
et al. (1981) for the application of such a model to a Milky Way H2O maser 
source. Unfortunately, the very limited quantity of data available from M33 
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Figure 7.3 The properties of the H2O masers in the core of 
NGC 4258, showing their spatial distribution (upper panel), and 
their line-of-sight velocities as a function of position along the X 
axis (lower panel). The line indicates the line-of-sight velocities 
expected for maser spots in a simple Keplerian disk which is only 
populated at radii greater than 4.1 mas. [After Miyoshi et al. (1995) 
using data kindly provided by L. Greenhill] 


does not justify this degree of complexity. 

VLBI observations of the core of the nearby spiral galaxy NGC 4258 
(Miyoshi et al. 1995) reveal that it contains a distribution of H2O maser spots 
with a remarkably simple geometry (see Figure 7.3). The emission spots 
are distributed in an almost-linear structure passing through the projected 
center of the galaxy, with the features on one side of the center redshifted 
relative to the galaxy’s systemic velocity, while those on the other side appear 
blueshifted. The natural explanation for these observations is that the maser 
spots lie in an edge-on disk orbiting around a large central mass in the galaxy. 
Radio jets emanate from the core of NGC 4258 which also shows up as an 
X-ray source, so this galaxy clearly contains an active galactic nucleus (see 
§4.6.2). The massive black hole believed to cause these phenomena explains 
the large derived central mass, and it also provides the energy source required 
to power the maser spots in the surrounding disk. 

Confirmation of this model has come from the measurements of the 
Doppler shifts of the individual maser spots. As Figure 7.3 shows, the line-of- 
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sight velocity of the maser spots at large radii drops off with the distance from 
the center of rotation as r—!/?, exactly as one would expect for Keplerian 
motion around a central point mass. At small projected radii, the line-of- 
sight velocities vary linearly with radius; this linear dependence can only 
occur if these maser spots all lie on a single circular orbit. We therefore end 
up with a picture of a disk around the central mass which is filled with masing 
material outside a ring of maser spots that defines the disk’s inner edge.> We 
can infer the radius of this inner edge by measuring the point at which the 
extrapolation of the linear variation in line-of-sight velocity meets up with 
the r~/? dependence at large radii; as Figure 7.3 shows, this intersection 
places the inner edge of the accretion disk at a radius of 0;, = 4.1 mas. The 
orbital velocity of material at this inner radius is then simply the amplitude 
of the line-of-sight velocity at the intersection point, vj, = 1080kms~! (see 
Figure 7.3). 

The individual maser spots follow their circular orbits because they are 
centripetally accelerated by the central mass. We can detect the line-of-sight 
component of this acceleration by measuring changes in the Doppler shifts 
of individual features with time. Since the acceleration is always directed 
toward the center of the disk, the largest line-of-sight accelerations should 
occur for maser spots that lie directly in line with the center of the accretion 
disk, where the full centripetal acceleration projects into the line of sight. 
VLBI observations spread over several years confirm that the velocities of 
spots which appear projected close to the center of the disk drift at a rate 
® = 9.5+1.1kms lyr! (Greenhill et al. 1995). As we have seen above, 
the maser spots that appear close to the center of the system are all emitted 
from close to the inner radius of the accretion disk, rj,. We can therefore 
equate the observed rate of drift in velocities to the centripetal acceleration, 
v= ve /Tin. Substituting the observed values for % and vin, we find that 
Tin = 0.12+0.01 pc. Comparing this radius with the angular size of the inner 
edge of the disk, 0;, = 4.1 mas, we find that NGC 4258 must lie at a distance 
of D = 6.2+0.7Mpc. Because of the simple geometry in this system, these 
observations provide an elegant technique for measuring absolute distances 
with very little ambiguity. The observed angular scales in NGC 4258 are 
well above the limits achievable by VLBI; assuming that similar bright H2,0 
maser systems can be found in the nuclei of other galaxies, the same method 
could be applied to systems at much higher redshifts in order to provide a 
relatively error-free absolute calibration to the cosmic distance scale. 


5 This disk geometry, and the requirement that maser emission can only occur along 
lines of sight where all the material has similar line-of-sight velocities, also explains the 
marked absence of detectable maser spots at intermediate radii — see Problem 7.4. 
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7.3 Relative distance estimators 


As we have seen in the last section, there exist a variety of methods for mea- 
suring absolute distances to particular classes of objects. We can estimate 
the distances to these objects directly because they possess unusually simple 
geometries, and so we can interpret observations of them on a straightfor- 
ward physical basis. Unfortunately, most astronomical phenomena are the 
result of much more complicated processes, so no such simple modeling is 
possible when we wish to estimate their distances. However, many of these 
more complex systems do seem to follow simple empirical] relations between 
their intrinsic properties, and these relations can be employed as distance 
estimators once they have been calibrated. 

We have already discussed one good example of such a calibrated dis- 
tance indicator for star clusters in §3.5.2: by comparing features in the CM 
diagrams of clusters at different distances, it is possible to derive the amount 
by which the apparent magnitudes of stars in the clusters differ, and thus 
derive their relative distances [equation (7.1)]. Once we have obtained an 
independent measure of the distance to one cluster (such as the Hyades, by 
using the moving cluster method; §2.2.2), we can convert all the relative 
measurements to an absolute scale. 

This example illustrates many of the features typical of relative distance 
estimators. First, the distance estimate depends on combining a distance- 
independent property of objects (the stars’ colors) with a property that does 
vary with distance (the stars’ apparent magnitudes) in order to estimate their 
distances. Second, it is empirically based: without having to understand the 
complex physics of stellar structure which dictates stars’ luminosities and 
colors, we can use the features that occur in the CM diagram to measure 
relative distances to clusters. Finally, since this distance estimator makes 
no use of the underlying physics, we have no a priori knowledge of the ab- 
solute calibration of the measure, and so we must obtain an independent 
measurement of the distance to at least one cluster in order to calibrate the 
relation. 

Once calibrated on to an absolute scale, the relative distance estimators 
hold a number of advantages over the absolute methods. First, relative 
estimators depend on empirically established properties of objects. They do 
not require that we make assumptions about the geometry or physics of the 
source, so they are not subject to the systematic uncertainties that arise if our 
assumptions are incorrect. Second, relative distances can be estimated using 
common sources. They can therefore be used to measure the distances to 
many more objects, allowing us to estimate distances over the wide range that 
we require if we are to calibrate the complete extragalactic distance scale. 
They are also useful for measuring subtle effects such as the dependence of 
measured distance on go [equations (7.7) and (7.8)], where large numbers of 
distances must be obtained in order to reduce the uncertainties which arise 
from random errors. A wide-ranging review of many of the relative distance 
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estimators used in defining the cosmic distance scale can be found in Jacoby 
et al. (1992). 


7.3.1 Luminosities of variable stars 


In §5.1.10 we saw that the luminosities of certain pulsating stars can be 
determined fairly accurately from their periods. Clearly, the distance mod- 
ulus of such a star can be estimated as the difference between its apparent 
magnitude and its predicted absolute magnitude. 

The two most important classes of pulsating stars are Cepheid variables 
and RR Lyrae stars. Cepheids have absolute magnitudes My ~ —3, so 
they can be studied to distance moduli m — M ~ 25 from the ground and 
m— M ~ 28 with HST. RR Lyraes are significantly fainter than Cepheids, 
having absolute magnitudes My ~ 0.6. Hence they can be studied only 
to distance moduli m — M ~ 22 from the ground or m — M ~ 25 with 
HST. Consequently, whereas Cepheids can be studied in objects that lie near 
the center of the Virgo Cluster, even with HST it is impossible to find RR 
Lyrae stars outside the Local Group. Under optimal conditions, photometric 
distances to Cepheids and RR Lyrae stars can have errors that are as small 
as 10%. 

Cepheid variables have played a central role in defining the extragalac- 
tic distance scale. Their widespread usefulness arises from their powerful 
combination of properties: 

(1) They obey very well-defined relations such as equation (5.19), and so 
make excellent standard candles. 

(2) Their characteristic variability means that they can be identified unam- 
biguously. 

(3) They are bright, and thus can be observed to reasonably large distances. 

(4) They are plentiful, and so are found both near the Sun where their 
distances can be calibrated quite straightforwardly, and in moderately 
distant galaxies where they can be used as distance indicators. 

As we have discussed in Chapter 1, Cepheids have the historical honor 
of providing Hubble with the evidence needed to demonstrate that an ex- 
tragalactic distance scale existed. As we shall see in §7.4, even today these 
stars play a central role in refining the calibration of that scale. 


7.3.2 Luminosity functions 


Objects that have a large spread, oy,, in the distribution of their absolute 
magnitudes are not individually useful as standard candles. However, if 
we observe a sample of N such objects all at the same distance, the average 
magnitude of the sample will have a smaller scatter of oy / WAN . If we observe 
a large enough sample, then the uncertainty in the average magnitude will 
become small enough for this quantity to serve as a standard candle. The 
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success of this approach rests on the assumption that the distribution of the 
sources’ luminosities — their luminosity function — does not vary from system 
to system, so that comparing the characteristic apparent luminosities of the 
distributions in different systems provides a direct measure of the relative 
distances to the systems. 


Globular clusters Shapley (1953) first noted that the average luminosi- 
ties of globular clusters in the Milky Way, the Large Magellanic Cloud, and 
M31 all appear very similar, suggesting that this quantity might be used as 
a standard candle. It is very difficult to detect globular clusters in other 
more distant spiral galaxies: unless the galaxy is very close to edge-on, it 
is hard to pick out the globular clusters against the uneven background of 
the emission from galaxy’s disk. In addition, the corrections to the globular 
cluster magnitudes due to extinction from dust in spiral galaxies are hard to 
characterize. However, it is straightforward to observe the globular clusters 
around a giant elliptical galaxy: these systems frequently have very large 
populations of globular clusters; the clusters are easy to detect against the 
smooth surface brightness distribution of their host elliptical; and the galaxy 
has little internal extinction. We are therefore faced with the unfortunate 
situation where the one system for which we have accurate measurements 
of the absolute magnitudes of globular clusters — the Milky Way — is very 
different from the giant elliptical galaxies for which we can obtain the best 
globular cluster data. Since there is no a priori reason why luminosity func- 
tions for the globular clusters in spiral and elliptical galaxies should be the 
same, there remains some doubt as to the absolute calibration of this distance 
estimator. 

Despite this caveat, the globular cluster luminosity functions (GCLFs) 
for other galaxies look remarkably similar. The number of clusters per unit 
apparent magnitude is well-characterized by a simple Gaussian luminosity 
function, 


¢ac(m) « e— (Fac) /2e%c, (7.21) 


By fitting this functional form to the observed luminosity distributions of 
globular clusters in elliptical galaxies, we can obtain a robust estimate for the 
apparent magnitude of the mean of the luminosity function, Mgc. Assuming 
that there is a universal intrinsic distribution of luminosities, comparisons 
between the values of 7igc obtained for different cluster systems provides 
a direct measure of their relative distances. In those cases where there are 
independent measures of the relative distances to these galaxies, these esti- 
mates agree well with the GCLF distances, suggesting that the assumption 
of a universal luminosity function is valid. A more complete discussion of 
the application of the GCLF distance estimator can be found in Jacoby et 
al. (1992) §4. 

The above fitting process works effectively as long as the GCLF has been 
measured to magnitudes fainter than 77igc so that the peak in the function 
is well-defined. To-date, the faintest globular cluster magnitudes that have 
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been measured come from Hubble Space Telescope images of NGC 4881 in 
the Coma Cluster. These observations have derived the luminosity function 
of this galaxy’s globular cluster population down to my = 27.6 (Baum et al. 
1994). For the Milky Way, the peak of the GCLF is at a V-band absolute 
magnitude of Mac = —7.440.1 (Abraham & van den Bergh 1995). If we 
assume that the peak in the luminosity function of giant ellipticals lies at a 
similar absolute magnitude, then the limiting magnitude of 27.6 means that 
we should be able to see the peak in the GCLF for galaxies out to distances 
of ~ 100 Mpc. 


Planetary nebulae Young planetary nebulae (§5.1.2) provide a second 
population of objects whose luminosity function is well-suited for use as 
a standard candle. Since the emission from these systems occurs in lines, 
planetary nebulae can be detected even against the bright stellar continuum 
emission of a galaxy by obtaining images through narrow bandpass filters 
centered on these lines. Further, planetary nebulae occur in profusion in all 
types of galaxies, so their luminosity function can be accurately determined 
by measuring the luminosities of large samples in individual systems. 

Observations of nearby galaxies through narrow bandpass filters cen- 
tered on the 500.7 nm line of ionized oxygen show that the planetary nebu- 
lae luminosity function (PNLF) varies exponentially with magnitude in this 
bandpass. However, this relationship breaks down at high luminosities, and 
there are essentially no planetary nebulae brighter than a cut-off apparent 
magnitude, mG. Ciardullo et al. (1989) have parameterized this abruptly- 
curtailed luminosity function in the form 


dpn(m) x e307 (a —- emit) ) : (7.22) 


The 500.7 nm line lies within the bandpass covered by the Johnson V filter, 
and the zero-point of the line emission intensity scale is customarily defined 
so that the derived magnitude is the same as would have been obtained if 
the line had been observed in the V-band. Numerically, if the flux collected 
from the line is fs90.7 W m~? then 


Observations of the planetary nebula populations in a variety of types of 
galaxy reveal that all can be well-described by equation (7.22), suggesting 
that the PNLF is a universal function, and can hence be used as a standard 
candle. The particular functional form of the PNLF is also ideally suited 
for standard candle analysis: the sharp cut-off provides a distinctive feature 
in the luminosity function, and, even with only limited data, the cut-off 
magnitude can be measured quite accurately. Since this feature occurs at 
the bright end of the luminosity function, we only need observe the brightest 
planetary nebulae in a galaxy to measure this characteristic cut-off. We can 
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Figure 7.4 The planetary nebula luminosity functions of M31 and 
of a composite of six galaxies in the Virgo Cluster. The 500.7nm 
apparent magnitudes, ms00.7, have been corrected for extinction. 
The line shows the adopted functional from of equation (7.22). 
The cut-off at bright fluxes is apparent in both the M31 and Virgo 
Cluster data. [After Jacoby et al. (1992)] 


therefore use the PNLF as a standard candle out to large distances where 
only the brightest planetary nebulae are detectable. For example, Figure 7.4 
shows the observed planetary nebula luminosity functions for M31 and for 
galaxies in the Virgo Cluster. The sharp cut-off at bright magnitudes is 
apparent even in the distant Virgo Cluster data, and comparison between 
the values of mEN yields an immediate estimate of the relative distances to 
these two systems (see §7.4.2). 


Calibration of the absolute value of the cut-off luminosity is generally 
referred to observations of planetary nebulae in the bulge of M31. After 
correction for extinction, the maximum luminosity in the PNLF for this 
system occurs at a magnitude of mf = 19.77 £0.04 (Ciardullo et al. 1989). 
If we adopt a distance to this system of 740 + 40 kpc (§7.4.1), we obtain an 
absolute magnitude for the cut-off of MSY} = —4.640.1. With this absolute 
calibration, the distance moduli of other galaxies can be readily estimated. 
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7.3.3 Novae and supernovae 


Novae, and especially supernovae, have large intrinsic luminosities. They 
have therefore always proved most attractive as possible standard candles 
for establishing the extragalactic distance scale. Indeed, part of the con- 
troversy in the original Shapley-Curtis debate on the existence of an extra- 
galactic distance scale (§1.2.2) arose because supernovae and novae had been 
confused. Now that this confusion has been resolved, both novae and super- 
novae provide very useful distance estimators. A thorough review of the use 
of these systems for distance estimation can be found in Jacoby et al. (1992) 
§5 (novae) and §6 (supernovae). 


Novae These transient systems, which result from the explosive ignition 
of material accreted on to the surfaces of white dwarf stars, have a number of 
desirable properties as standard candles. They are intrinsically bright (My ~ 
—7 at maximum brightness), and so can be observed out to large distances. 
They are produced by old stellar populations, and thus occur in elliptical 
galaxies as well as spirals. Since ellipticals do not contain intrinsically-bright 
objects such as early-type stars, novae are relatively easy to pick out in 
these systems. Further, since ellipticals do not suffer much from internal 
extinction, it is straightforward to measure the photometric properties of the 
novae that they host. Finally, observations indicate that the brightnesses of 
novae do not depend significantly on the metallicity of their surroundings 
(van den Bergh & Pritchet 1986), and so novae from environments with very 
different heavy element abundances can be compared directly. 

For novae within the Milky Way, we can also determine distances by 
the simple absolute method of expansion parallax. The expanding shell 
of gas ejected by the explosion is thrown off with a typical velocity of vexp ~ 
1000kms~!; this quantity can be measured directly from the Doppler shifts 
in spectral lines. After a time t, the shell will show up in images as a structure 
of angular size 68, which we know corresponds to a physical dimension of 
Vexpt. We thus know that the source lies at a distance of D = vexpt /60. 

The angular diameters of extragalactic novae are too small for this 
method to be applicable, so we must return to the idea of using them as 
standard candles in order to determine their distances. Unfortunately, the 
intrinsic luminosities derived for the Galactic novae with measured distances 
show that these systems are not standard candles: their absolute magnitudes 
range from My = —4.8 to My = —8.9 (Cohen 1985). However, novae are 
not identical in other properties, either: Zwicky (1936) noticed that the light 
output from faint novae decays more slowly than that from bright novae. 
Since the measured rate of decline does not depend on the distance to the 
nova, we can use this measurement to predict the intrinsic brightness of the 
nova. Cohen (1985) quantified this relation using the 11 Milky Way novae 
with well-determined distances. She discovered that they obey a relationship 


My (max) = —10.7 + 2.3 log(t2/day), (7.24) 
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Figure 7.5 Plot showing the abso- 
lute magnitudes of Milky Way novae 
at maximum brightness, My (max), 
as a function of the time they took 
to fade from this maximum by two 
magnitudes, tz. The line shows the 
adopted linear relation of equa- 

tion (7.24). [After Cohen (1985)] 


where My (max) is the absolute magnitude of the nova at its maximum, and 
tz is the time that the nova took to decline in brightness by two magnitudes 
from this maximum (see Figure 7.5). The dispersion about this best fit is 
~ 0.5 magnitudes, and so by measuring ft) for a nova, we can predict its 
distance to within a factor of ~ 25%. By observing many novae in a single 
galaxy, this statistical uncertainty can be reduced significantly. 

Some caveats to this result come from the analysis of novae in M31 by 
Capaccioli et al. (1989). They showed that the brightest novae in M31 are 
fainter than predicted by equation (7.24). Since it is these brightest novae 
that are most likely to be detected in more distant galaxies, the application 
of this relation at large distances may not be appropriate. In addition, a few 
“anomalous” novae lie more than a magnitude brighter than equation (7.24). 
These anomalous systems are easy enough to spot in M31 where they are 
well-separated from the majority of the data. However, in more distant 
systems where fewer novae have been observed, they would be much harder 
to exclude. 

As an alternative distance estimator based on novae in well-observed 
systems, we might use some characteristic of their luminosity function as a 
standard candle (see §7.3.2). The luminosity function of novae at maximum 
brightness appears to be a double-peaked function with a reasonably sharp 
central minimum (Capaccioli et al. 1989). It has therefore been suggested 
that the location of this minimum in the luminosity function might provide 
a useful standard candle. 


Type Ia supernovae It has been known for a long time that many of 
the brightest supernovae have very similar properties (Kowal 1968). In par- 
ticular, type Ia supernovae (see §5.2.3) follow remarkably similar evolution 
in their light output with time. Since it is believed that these supernovae 
all originated from essentially identical systems, this similarity is not sur- 
prising. We might therefore also predict that the luminosities of type Ia 
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supernovae might also all be very similar, so these extremely bright sources 
should provide ideal standard candles which can be observed at very large 
distances. 

A thorough review of the suitability of type Ia supernovae as standard 
candles can be found in Branch & Tammann (1992). One particularly im- 
pressive piece of evidence which supports their use comes from studies of the 
few galaxies in which two type Ia supernovae have been observed. In the three 
systems for which observations of two separate supernovae are available, the 
apparent V-band magnitudes of the supernovae at maximum brightness dif- 
fer by only 0.1 mag on average. Since the separate supernovae are likely to 
be viewed through different amounts of extinction in their host galaxies, the 
intrinsic scatter in maximum brightness is probably even smaller. 

The luminosities of type Ia supernovae have been calibrated against a 
number of other distance estimators. Most recently, Hubble Space Telescope 
images have been used to detect Cepheid variables in the galaxies that have 
hosted nearby type Ia supernovae (Sandage et al. 1992, Saha et al. 1995). 
Adopting the Cepheid distance scale of §7.3.1, these observations imply that 
the supernovae have brightest absolute magnitudes of My = —19.52 + 0.07, 
and Mp = —19.48 + 0.07. 

A further technique for directly calibrating well-observed type Ia super- 
novae is provided by the Baade~Wesselink method (§7.2.1). Application of. 
this method seems to produce somewhat brighter estimates for peak absolute 
magnitudes (Branch, Drucker & Jeffery 1988), but given all the uncertainties 
in the Baade—Wesselink analysis, the difference may not be significant. 

A final check on the absolute magnitudes of these supernovae comes 
from direct modeling of the explosion. The physics of this process is suffi- 
ciently simple that it should be possible to model the explosion at the level 
required to predict the absolute magnitude of the resulting supernova. At 
present, there remain a number of uncertainties in the modeling process, but 
the values for unknown quantities can be derived by leaving them as free 
parameters and fitting to the observed detailed properties of the supernova. 
These properties include the exact spectrum of the supernova, and the vari- 
ation of its observed brightness with time (or “light curve”) in the various 
photometric bands. This fitting procedure yields a best fit which predicts an 
absolute magnitude of Mg = —19.4+0.3 (Branch 1992), consistent with the 
empirical calibrations. 

Despite this reasonably coherent picture, some doubts have now been 
cast on the status of type Ia supernovae as standard candles. For example, 
Maza et al. (1994) have studied two apparently normal type Ia supernovae, 
one in a galaxy at a redshift of 0.020 and the other at a redshift of 0.019. 
If these supernovae were identical in intrinsic luminosity and lay at the dis- 
tances implied by the Hubble law for their redshifts, then their apparent 
brightnesses at maximum luminosity should differ by only 0.1 magnitudes. 
In fact, they differ by 0.8 + 0.2 mag in the B band and 0.4 + 0.2 mag in the 
I band. 


422 Chapter 7: The Cosmic Distance Scale 


Calibration of nine well-studied type Ia supernovae against other dis- 
tance indicators has led Phillips (1993) to suggest that the colors of these 
systems vary systematically with the rate at which their luminosities decline. 
Work by Riess, Press & Kirshner (1995) has quantified this correction by fit- 
ting template light curves to the observed photometry of supernovae in order 
to measure the rate at which each declined. As for the novae discussed above, 
measurements of the shapes of light curves provide a mechanism for predict- 
ing accurately the intrinsic brightness of a supernova, and hence producing 
a distance estimator with a small scatter. 

Supernova simulations are also beginning to come to grips with the 
variations between explosions: attempts have been made to explain the dif- 
ferences in peak luminosity and light curve shape by allowing the details of 
the explosion to vary from system to system (e.g. H6oflich & Khokhlov 1996). 
Thus, as an alternative to the purely empirical luminosity calibration based 
on light curve shapes, one can fit complete theoretical models of type Ia su- 
pernova explosions to an observed light curve. An estimate of the intrinsic 
luminosity of the explosion is then provided by that of the best-fit model. 


7.3.4 Distances from galaxy kinematics 


Once we look beyond the Milky Way, the clearest structures that we see 
in the Universe are other galaxies. One obvious method for measuring the 
distance to these structures is by comparing their properties to those of 
nearby galaxies whose distances we can estimate using the methods discussed 
above. Unfortunately, galaxies are demonstrably not standard candles or 
standard rulers. Images of clusters of galaxies reveal that the members, 
which all lie at approximately the same distance, have an enormous range of 
apparent magnitudes and angular diameters. Clearly, we will have to look 
to more subtle properties if we are to use galaxies themselves as distance 
indicators. 

One promising line of investigation is to use measurements of the internal 
kinematics of a galaxy in order to estimate its intrinsic luminosity. Very 
broad arguments based around the virial theorem (BT §4.3) imply that the 
average speed at which material orbits in a gravitating system increases with 
the mass of the system. We might also make the reasonable — but unproven — 
assumption that the amount of luminous material in a galaxy increases with 
its mass. Taken together, these properties imply that the intrinsic luminosity 
of a galaxy should increase with the amplitude of the orbital velocities of its 
constituents. We can obtain a distance-independent measure of the orbital 
velocities in a galaxy by looking at the Doppler shifts of its spectral lines 
relative to the galaxy’s systemic velocity, and by measuring the broadening 
of the lines (see §8.2.4 and Chapter 11). We might then hope to use such 
kinematic observations to estimate the intrinsic luminosity of the galaxy, and 
thus, by comparison to its apparent magnitude, its distance. 


Spiral galaxies The first broadly successful implementation of this ap- 
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proach was presented by Tully & Fisher (1977). For a sample of ten nearby 
spiral galaxies, they showed that the galaxies’ rotation velocities (measured 
using Doppler broadening in 21cm emission) correlates quite tightly with 
their absolute magnitudes (measured from apparent magnitudes and dis- 
tances calculated from Cepheid variables). This correlation is known as the 
Tully—Fisher relation. 

The single dish radio telescope observations used in this analysis do not 
spatially resolve the HI disk in each system, and so the shape of spectrum 
around 21cm provides us with a picture of the Doppler shifts in the HI 
throughout the galaxy. As we shall see in §8.2.4, most of the HI in the disks 
of spiral galaxies follows approximately circular orbits at the same speed, w¢. 
However, not all of this rotational motion will lie along the line of sight, so 
different parts of the galaxy will have their HI emission Doppler shifted by 
different amounts. If the galaxy is inclined at an angle 7 to the line of sight, 
then the observed line-of-sight velocity for the HI will lie between —v, sini 
and +v, sini relative to the galaxy’s systemic velocity. If one makes a spa- 
tially unresolved observation of such a disk, one finds that the 21 cm emission 
line is Doppler-broadened into a characteristic “two-horned” structure, with 
sharp peaks at frequencies corresponding to Doppler shifts of tv, sini. Thus, 
measuring the line width gives a direct measure of v,. Conventionally, this 
width is measured by finding the points in the line’s wings at which the inten- 
sity has dropped to 20% of the peak value in the two horns. The difference 
in Doppler shift between these points in the high- and low-frequency wings, 
Woo, then provides the measure of the galaxy’s kinematics. 

To relate W2o to the intrinsic rotation velocity, v,, we need to correct 
for two effects. First, as we shall see in §§8.1.4 and 9.2.6, HI does not 
follow exactly circular orbits, but has a modest degree of random motion 
superimposed. These random motions broaden the line profile slightly, so 
Wo overestimates the Doppler broadening due to rotational motion. This 
factor can be corrected for by subtracting an estimate of the contribution 
to Wo arising from random motions: Wr = Wao — Wyana-® For gas with a 
Gaussian random velocity distribution with dispersion o, the correction to 
Wao is Wrana = 3.60 (Bottinelli et al. 1983). Since o ~ 10kms~!, and Wao 
is only ~ 100kms~! for faint galaxies, this correction factor can be highly 
significant. The second effect we must correct for is inclination. Since the 
velocity width that we measure is essentially 2v, sin 7, the effects of inclination 
can be removed by dividing We by sini. Thus, 


Wi, = (Woo — Wrana)/ sini (7.25) 


provides an inclination-independent measure of the rotation rates of spiral 
galaxies. 


6 More sophisticated formulations of this correction factor exist [e.g. Tully & Fouqué 
(1985)], but they do not produce significantly different answers. 


424 Chapter 7: The Cosmic Distance Scale 


J 
no 
nN 


} 
iy] 
vn 


Absolute Magnitude 
| 
es 


! 
ar 
Oo 


vitisitiiitiri 
2 22 2.4 26 2 22 24 26 


log(4v,/km s“!) 


Figure 7.6 Plot of absolute magnitude in B- and H- bands as 
a function of velocity width for galaxies with independently de- 
termined distances. [From the data published in Pierce & Tully 
(1992)| 


Corrections must also be applied to the absolute magnitudes before they 
can be correlated with W},. Unfortunately, a galaxy that contains sufficient 
HI for a profile to be obtained also contains large amounts of dust, so some 
of the galaxy’s luminosity will be internally absorbed. Since the dust is con- 
centrated in the disk of the galaxy, the amount of absorption will depend on 
the inclination of the galaxy. Absorption will be maximized when the galaxy 
is edge-on, as in this orientation the starlight from the disk of the galaxy 
has to pass through the longest path length of obscuring dust before leaving 
the galaxy. A number of methods have been developed, which correct for 
this factor by modeling the properties of the dust layer [e.g. Tully & Fouqué 
(1985)].”_ Alternatively, one can minimize the impact of the correction by 
obtaining the requisite photometry at infrared wavelengths, where the ef- 
fects of extinction are small — an approach pioneered by Aaronson, Huchra 
& Mould (1979). The infrared holds the further advantage that light in this 
waveband is dominated by old stars rather than the young stars that domi- 
nate bluer wavelengths. Thus, the infrared flux provides a measure of stellar 
content that is less dependent on current star formation rates, so it better 
reflects the overall stellar population of the galaxy. The total stellar content 
is, presumably, closely tied to the total mass of the galaxy and hence its 
kinematics. We would therefore expect W}, to be most closely correlated 
with the infrared luminosity of the galaxy. 

Once the necessary corrections have been applied, the inclination cor- 
rected absolute magnitudes, M*, can be determined for nearby “calibrator” 
galaxies with good Cepheid distances. The Tully—Fisher relation between 


* The photometry must also, of course, be corrected for Galactic extinction — see §3.7.1. 
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log Wi, and M* is remarkably tight (see Figure 7.6). As well as reflecting 
an interesting underlying piece of astrophysics, the tightness of the relation 
gives us some confidence in the correction factors that have been applied: 
since galaxies with comparable values of W}, also have comparable values of 
Mz, even when their inclinations differ, it would seem that the correction 
factors have been chosen appropriately. 

Pierce & Tully (1992) have calibrated Tully-Fisher relations based on 
photometry in B-, R-, I- and H-bands, using data from six calibrator galax- 
ies. They discovered that the Tully—Fisher relations in these bands are 


Mi, = —7.48(log W}, — 2.50) — 19.55 + Ap + 0.14, 


( 
Mi, = —8.23(log Wi, — 2.50) — 20.46 + Ap + 0.10, (7.26) 
Mi = -8.72(log Wi, — 2.50) — 20.94 + 0.10, 
( 


Mi, = —9.50(log W}, — 2.50) — 21.67 + 0.08. 


As predicted above, the relations in the infrared show a somewhat smaller 
scatter than those in the optical. The correction factors, Ag and Ap, have 
been introduced because Pierce & Tully (1992) found that different bands 
produce inconsistent results. Using the local calibrations (for which Ag and 
Ap are zero), they calculated the distance to the Ursa Major Cluster by com- 
paring members’ absolute magnitudes as predicted by these relations to their 
apparent magnitudes. The distance modulus of this cluster was found to be 
systematically larger in the B- and R-bands than in the infrared. Arbitrary 
offsets of Ag = 0.25 and Ar = 0.06 were therefore introduced for cluster 
galaxies (but not the calibrating field galaxies) to explain the discrepancy — it 
would appear that cluster spiral galaxies are systematically redder than field 
galaxies. The ad hoc nature of the applied correction means that distances 
derived using these optical bands may well be subject to residual systematic 
errors; future users of the Tully~Fisher relation are likely to concentrate on 
the infrared bands where this uncertainty does not arise. 


Elliptical galaxies The Tully—Fisher relation is only applicable to spiral 
galaxies, so one obvious question is whether an analogous relationship holds 
for elliptical galaxies. Ellipticals are free of obscuration, so the existence 
of such a relation would be very useful for determining accurate distances. 
Further, since most rich clusters of galaxies contain very few spirals, the 
relation would be particularly useful for estimating the distances to these 
systems. 

Happily, as we have already seen in §4.3.4, analogous relations hold 
for elliptical galaxies. Specifically, as Figure 4.43 illustrates, the luminosity 
inside the effective radius of an elliptical galaxy, L., is quite tightly correlated 
with its central velocity dispersion, oo [the Faber—Jackson relation — see 
equation (4.38)]. Thus, we can predict the absolute magnitude of an elliptical 
galaxy on the basis of its central velocity dispersion, and derive its distance 
by comparing this quantity to the apparent magnitude of the galaxy. 
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The scatter in the relation is considerable, and so its ability to predict 
accurate distances is rather poor. As we also saw in §4.3.4, much of this 
scatter arises because the properties of elliptical galaxies also depend on 
their effective radii, R,, and if we calculate log R,, (I). (the mean surface 
brightness inside R.), and log a9, we find that galaxies lie in a single plane in 
these coordinates [equation (4.39)]. Since both (I), and oo are independent 
of the distance to a galaxy, we can use the observed values of these quantities 
to predict the intrinsic effective radius of the galaxy via equation (4.39), and 
by comparing this size-scale to the observed angular effective radius of the 
galaxy, we can predict its distance. 

In practice, it is more common to use the approximation to the funda- 
mental plane provided by the D,—oo relation [equation (4.43)| to estimate 
distances. This formulation has the advantage that the parameter D,, — the 
radius within which the mean surface brightness is 20.754, — can be easily 
and robustly estimated from quite poor quality photometry. By comparing 
the observed angular size of D, to the scale predicted by equation (4.43), 
one obtains the distance to a galaxy. 

It should be borne in mind that the distances to individual galaxies 
derived in this way are not very good: the 15% scatter about the D,-oo 
relation of equation (4.43), for example, means that the distance to any 
single galaxy has an intrinsic uncertainty of 15%. However, by observing 
many galaxies within a single cluster, the uncertainty in the distance to 
the cluster can be beaten down to insignificance. Similarly, if one’s goal is 
to calibrate the Hubble constant (§7.4.3) or measure large-scale streaming 
(§7.4.2) by measuring distances to large samples of galaxies, then the large 
uncertainties in individual measurements become unimportant. 


7.3.5 Surface brightness fluctuations 


As we have already seen, if a galaxy is sufficiently close for its individual 
stars to be resolved, then we have access to a wealth of information that 
can be used to help determine its distance. Even in more distant galaxies 
where individual stars cannot be resolved, the discrete nature of the stars 
that make up the galaxy’s light still has observable consequences. 

As a simple illustration, consider a galaxy at a distance D which is made 
up from identical stars of luminosity L distributed such that there are n stars 
per unit area across the observed face of the galaxy. If we make an image of 
this galaxy with an angular resolution of 66, then the image will contain an 
average of N = n(D60)? unresolved stars in each 66 x 60 resolution element. 
The flux that we observe from each star is f = L/(4%D7), so the average 
total flux from the stars in each resolution element will be 


F = Nf = nL66?/(4n). (7.27) 


This average brightness is independent of D, and so clearly cannot be used as 
a distance estimator. However not all resolution elements will contain exactly 
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the same number of stars: Poisson fluctuations mean that this number will 

: : a71/2 : , 
have a dispersion of N°. The corresponding root mean square fluctuations 
in the observed flux per resolution element will be 


= 1/2661 
ég2n ps =p (7.28) 
4n 
Thus, the fluctuations in the surface brightness scale inversely with distance: 
a galaxy at twice the distance would appear twice as smooth. Combining 
equations (7.28) and (7.27), we find 


(7.29) 


by measuring the mean brightness per resolution element and the RMS fluc- 
tuations between different elements, we can infer the flux from each individ- 
ual star. If two galaxies are made up from intrinsically identical stars, then 
comparing the values of f obtained by this method will immediately yield 
their relative distances. 

In fact, of course, galaxies are not made up from stars with identical 
luminosities. More realistically, a galaxy can be modeled as containing stars 
from a variety of species with different intrinsic luminosities L;. If there are 
an average of N; stars per resolution element from species i, and each of these 
stars contributes f; = L;/(4mD7) to the total brightness, then equation (7.29) 
generalizes to 


of Poapecies Nif? = (L) 


= 7 apeciea Vat ~ 4x D2’ (7.30) 
where a 
iL; 

= Sa (7.31) 
Nil; 


is a weighted mean stellar luminosity (Tonry & Schneider 1988). Thus, the 
RMS fluctuations are the same as those which we would observe from a 
galaxy populated entirely by stars of luminosity (L). If two galaxies contain 
the same fractional mix of stellar species, then (L) is the same for both of 
them, and so we can calculate their relative distances from surface brightness 
fluctuations using equation (7.30). 

In order to use this method to measure the absolute distances to galax- 
ies, we need the value of (L). If we know the stellar luminosity function of 
a galaxy, we can calculate (L) directly from equation (7.31).8 Alternatively, 
measurements of the surface brightness fluctuations in a galaxy whose dis- 
tance has been measured using a different method can be used to calibrate 


8 For a continuous luminosity function, we replace the summations over discrete species 
in equation (7.31) by integrals over luminosity. 
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its value. This calibration can then be applied to determining the distances 
to other galaxies if it is assumed that they have the same stellar luminosity 
functions. Note, however, that the numerator in equation (7.31) depends 
on L?, so the average is heavily weighted toward the brightest stars in the 
galaxy: for any reasonable luminosity function, (LZ) will be similar to the 
luminosity of the brightest red giant stars, and does not change significantly 
if the luminosity function of fainter stars varies from galaxy to galaxy. 

Since the dominant giant stars are brightest in the red, the fluctuations 
in surface brightness will be most readily detectable when observed in the 
I-band. This choice of filter also has the advantage that the absolute magni- 
tudes of giant stars in this band do not vary much with the stars’ metallicity 
and age. The value of (L) in the [-band should thus depend little on the 
properties of the galaxy’s stellar population. 

The method clearly works best when looking at the outer parts of galax- 
ies where the number of stars is low, and the Poisson fluctuations are rela- 
tively large. However, the finite brightness of the night sky places a limit as 
to how far out it is possible to look. In the J-band, the sky at a dark site 
has an intrinsic brightness of ~ 19 mag arcsec™?, and it is difficult to observe 
regions of galaxies very much fainter than this limit. A square arcsecond is 
typical of the resolution possible with ground-based optical telescopes; with 
this image quality, the sky brightness restricts accurate [-band photometry 
to a limit of ~ 20 mag arcsec” ?. 

For a galaxy at a distance of 20 Mpc, this surface brightness corresponds 
to the I-band light from N ~ 104 giant stars per square arcsecond. In images 
limited by seeing to a resolution element of 1 arcsec”, the Poisson variations 
in this number cause the smooth light profile of the galaxy to fluctuate at a 
level of ~ 1%. Thus, fluctuations are detectable in galaxies at a distance of 
~ 20 Mpc if we can measure the photometry of these systems to better than 
1%. 

Various sources of noise combine to make it difficult to do photometry to 
this level of accuracy. The simplest to deal with are the purely observational 
errors: Poisson statistics from the finite number of photons detected, and 
the intrinsic noise of the detector (such as the read noise in a CCD) both 
contribute fluctuations to the observed photometry. These sources of noise 
can be effectively eliminated by obtaining long-exposure images. The signal 
arising from real features in a galaxy’s photometry, including surface bright- 
ness fluctuations, will grow linearly with the integration time, tinp,. However, 
the Poisson noise grows only as bee and the intrinsic noise of the detector 
will increase little, if at all, with integration time. In a long integration, these 
sources of noise can thus be made negligible relative to the signal. 

A rather more troublesome source of “noise” arises from other objects 
that give rise to fluctuations in the observed galaxy photometry. In partic- 
ular, unresolved faint background galaxies and globular clusters associated 
with the target galaxy will produce fluctuations of their own in the smooth 
brightness profile of the galaxy. These fluctuations must be characterized 
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by measuring individual luminosities for those contaminating objects which 
are bright enough to be identified. The luminosity function for these con- 
taminants can be modeled using the observed brighter objects; extrapolating 
the luminosity function to fainter luminosities then allows us to predict how 
the faint undetected contaminants will contribute to the observed brightness 
fluctuations. After these effects have been allowed for, any residual fluctua- 
tions can be attributed to the intrinsic surface brightness fluctuations of the 
galaxy. 

This process is described in detail by Tonry & Schneider (1988), who 
present the first detection of intrinsic surface-brightness fluctuations using 
CCD photometry of two nearby galaxies. Tonry (1991) has since calibrated 
the value of (LZ) using galaxies with Cepheid distances. By looking at a 
number of galaxies in the same cluster, he also established that (LD) varies 
somewhat with the color of the observed galaxy, and derived the empirical 
relation 


(Mr) = 3.0(V — I) — 4.8, (7.32) 


where (My) is the I-band absolute magnitude corresponding to (L). His 
comparisons between the distances to galaxies obtained using this effective 
standard candle and those derived from planetary-nebulae luminosity func- 
tions (87.3.2) revealed satisfyingly identical results. 


7.4 Results 


As illustrated in Figure 7.7, there is now a wide array of different techniques 
that can be used to measure distances. As we discussed at the beginning of 
this chapter, the most secure absolute distance estimates have traditionally 
been obtained for nearby objects. In order to bridge the gap to objects at 
large distances, we have to work our way outwards using a whole series of 
techniques for estimating relative distances, in order to construct the com- 
plete cosmic distance ladder. The ultimate goal of this process is to reach 
out to distances where redshifts are dominated by the expansion of the Uni- 
verse, so that we can derive an unambiguous value for the Hubble constant. 
It is clear from Figure 7.7 that there are many ways of combining the dif- 
ferent overlapping distance estimators in order to connect the known local 
distances to the largest cosmic scales. Unfortunately, different combinations 
of distance estimators have produced radically different answers: the Hubble 
constant derived from these analyses takes values from ~ 40kms~! Mpc~* 
(e.g. Sandage 1993b) up to ~ 80kms~! Mpc™' (e.g. van den Bergh 1994). As 
the distance to an object of given redshift scales as Ho : [see equation (7.2)], 
these extremes are known as the long distance scale and short distance 
scale, respectively. All of the methods for deriving distances are prone to 
sizeable systematic errors, and so it is not surprising that such a wide range 
of values for Hp can be derived by employing different techniques. 
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Figure 7.7 Summary of the cosmic distance scale. The upper half of the plot shows 
absolute and relative distance measures: solid bars indicate the range of distances over 
which the measure has been applied; dashed bars indicate that the method is “anchored” 
by an absolute calibration. The lower half of the figures lists objects whose distances we 
seek. Bars indicate the approximate range of distances over which these objects can be 


found. 
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What is rather more disturbing is the fact that different authors us- 
ing approximately the same set of steps also fail to agree on distances 
by significant factors. For example, de Vaucouleurs & Peters (1986) have 
analyzed a sample of spiral galaxies using Tully-Fisher distances (§7.3.4) 
to obtain Hyp = 110+ 10kms~!Mpc~*, whereas Sandage (1988) has ar- 
gued that the true Hubble constant derived from these data should be 
56 +13kms~!Mpc !. Clearly, the errors quoted on these two estimates 
are not mutually consistent, and at least one of the analyses must be wrong. 

In fact, historically, the derived distance scale has been much more 
closely correlated with the author of the work than it is with the particular 
choice of distance estimator. The advocates for the long and short distance 
scales are firmly set in their respective views, and have found support for 
their preferred answer using a wide range of techniques. It is only the com- 
plexity of the cosmic distance ladder that allows this dichotomy in scale to 
persist. Each step on the ladder is accompanied by a degree of uncertainty. 
Part of this uncertainty is made up of the usual random errors due to finite 
number statistics, etc, but part arises from ill-constrained systematic effects. 
As we have seen, different techniques for calibrating a standard candle can 
easily give results that differ systematically by a few tenths of a magnitude. 
Since the origin of these differences are not well understood, it is impossible 
to decide objectively which calibration to adopt, and so a subjective choice 
must be made. By the time that three or four such choices have been made in 
the construction of the cosmic distance ladder, it is easy to build up distance 
estimates that differ by a factor of two (see Problem 7.6). Individuals nat- 
urally tend to give more weight to the particular distance calibrations that 
fit in with their own previous work, causing the observed strong correlation 
between author and result. 

As we discuss below, this conflict is finally beginning to be resolved. 
The improved quality of data available from instruments such as HST means 
that distance-measuring techniques can be pushed to fainter magnitudes, and 
correspondingly greater distances. It is thus possible to reduce the number 
of steps that one has to make in order to define the cosmic distance ladder. 
By reducing the number of points in the analysis at which small systematic 
errors can creep in, the freedom to combine these uncertainties to produce 
a pre-ordained result is removed. Taking this approach to its logical conclu- 
sion, the best determination of the distance scale would be one based on a 
technique that measures the absolute distance to remote extragalactic ob- 
jects in a single step. Although the systematic uncertainties in techniques 
for measuring large absolute distances are still too large to provide an un- 
equivocal answer, advances are also being made in beating these errors down 
to manageable levels. 

The growing profusion of techniques for measuring relative distances is 
also helping to clarify the situation, since it is possible to detect systematic 
errors by comparing the distances predicted by different methods in the 
range where they overlap. For example, if we derive distances to a sample of 
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galaxies using two different methods such as the Tully-Fisher relation and the 
surface brightness fluctuation technique, then a plot of one distance estimate 
versus the other should give a straight line of unit slope. Any significant 
departure from this linear relation implies that at least one of the methods 
is compromised. By carrying out such comparisons for a number of distance 
estimators, we can isolate those methods that fail. 

As we shall see below, these advances mean that we are now close to 
being in the remarkable position of being able to measure the distances to 
objects right across the Universe to within an accuracy of ~ 10%. 


7.4.1 Distances within the Local Group 


Historically, distances to objects within our own Local Group have played a 
major role in determining the extragalactic distance scale. As we have seen, 
it was the analysis of Cepheids in the Magellanic Clouds which determined 
their usefulness as distance indicators, and observations of these stars in M31 
which established the existence of extragalactic scales. Reviews of distance 
determinations to other galaxies within the Local Group can be found in van 
den Bergh (1989) and van den Bergh (1994). In this section, we discuss the 
measurement of distances both to the center of our own galaxy and to two 
local group members. As some of the more reliable distance determinations, 
these measurements still provide a crucial anchor point for establishing the 
extragalactic distance scale. 


Distance to the Galactic center Measurements of the distance to the 
Galactic center, Ro, have recently been reviewed by Reid (1993). This rel- 
atively short distance is disproportionately difficult to measure, mainly be- 
cause our view of the Galactic center is so heavily obscured. One must 
therefore either observe in one of the few relatively unobscured lines of sight 
that passes near the Galactic Center (such as “Baade’s Window”), or use 
distance estimates that do not require optical observations. 

The only useful absolute distance estimator for Ro is provided by H2O 
masers in Sgr B2 and other star-forming regions [see §7.2.4, H2O maser 
emission in the Milky Way has been reviewed by Reid & Moran (1988)]. 
Statistical parallax applied to the line-of-sight velocities and proper motions 
of these sources yield Galactocentric distances that range from Rpg = 6.5+ 
1.5 kpc (Reid et al. 1988) to Ro = 8.1+1.1kpc (Gwinn, Moran & Reid 1992). 

A variety of calibrated relative distance estimators have also been em- 
ployed in the estimation of Ro. Classically, the best candidates are provided 
by populations of halo objects — we shall see in §10.5, the density of halo 
objects peaks at the Galactic center, so Rg can be estimated by determining 
the distance to this density peak. As we have seen in Chapter 1, the vast 
scale of the Milky Way was first established by Shapley (1921), who applied 
this approach using globular clusters as standard candles and rulers. Reid 
(1993) reviews the numerous attempts to refine his analysis. One major 
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Table 7.1 Galactic center distance estimates 


Method Distance/kpc 
H2O masers 7240.7 
RR Lyrae stars 7840.4 
Globular clusters 8.0+0.8 
Cepheids 8.0 + 0.5 
Red clump stars 8.4+0.4 


source of systematic error in these studies is uncertainty in the distances 
to individual clusters. These distances are usually determined from the ap- 
parent magnitudes of horizontal branch stars, including RR Lyrae variables 
(§6.1.2). Hence any uncertainty in the absolute magnitudes of RR Lyraes 
will inevitably show up in an error in the value of Rp that one obtains from 
globular clusters. In these circumstances, the best procedure is to deter- 
mine Ro directly from field RR Lyraes, of which enormous samples are now 
available (Figure 5.14). 

The two biggest problems to be overcome when determining Rp from RR 
Lyraes or any other halo population are: (i) the effects of extinction; and (ii) 
uncertainties in the absolute magnitudes of the stars observed. Both of these 
problems are substantially mitigated if one observes in the near-IR (Fernley 
et al. 1987). In the fields studied near the Galactic center, extinction in the 
visual band amounts to Ay ~ 1.6 magnitudes, while in the near infrared, 
it is only Ax ~ 0.17 magnitudes. Moreover, the absolute magnitudes of 
RR Lyrae stars seem to vary less with [Fe/H] in the near-IR than they do 
in the visual band. Hence, the uncertain metallicities of RR Lyraes near 
the Galactic center have less impact on distance determinations made in the 
infrared than those based on the optical distance modulus. 

By fitting the observed apparent-magnitude distributions of RR Lyraes 
in two fields to those predicted if the density of stars is given by 


1 
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(7.33) 


Carney et al. (1995) concluded that q ~ 0.8, A ~ 2.3 and Rp = 7.840.4kpc. 

A problem with this measurement of Ro is that it is based on an absolute- 
magnitude calibration of RR-Lyraes that appears to conflict with the gener- 
ally accepted absolute-magnitude calibration of the Cepheids in that it yields 
a distance modulus to the LMC that is 0.2 — 0.3 magnitudes smaller than 
that obtained from Cepheids. When Carney et al. take the distance to the 
LMC to be that given by the Cepheids and then determine Mx (RR Lyrae) 
from stars in the LMC, Rp increases to 8.9+0.5kpc. This simple example 
illustrates the difficulty in determining the distance scale discussed above. 
One is often confronted by mutually-inconsistent data sets, and the derived 
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distances depend critically on which data one chooses to trust, which calibra- 
tion of a relative distance estimator one adopts, etc. The sample estimates 
for Ro listed in Table 7.1 should therefore be viewed with a healthy degree 
of skepticism. 

When comparing different estimates for Rp, one must take care to con- 
sider the “band-wagon effect,” whereby new published values show a ten- 
dency to agree with existing estimates, even if a completely different method 
has been employed. In fact, such a collective phenomenon is difficult to avoid: 
if a new method returns a value for Rg in line with conventional wisdom, 
one is likely to write up and publish the result with little further thought; 
if, on the other hand, the method returns a discrepant value, one will search 
carefully for possible explanations, and will probably not produce a paper 
until the discrepancy can be explained. Thus, results in the literature will 
always tend to cluster around a current “accepted” value for Ro. Reid (1993) 
found clear evidence for this effect by plotting estimates for Ro as a function 
of their publication date; he showed that there was a clear trend in this plot, 
with estimates for Rp decreasing systematically with time from Ro ~ 9kpc in 
1975 to Ro ~ 7.5 kpc in 1985. More recently, the published values for Rg have 
recovered slightly to cluster around a best current estimate of Ro ~ 8kpc. 


Distance to the Large Magellanic Cloud The Large Magellanic Cloud 
(LMC) is sufficiently nearby that a range of stellar calibrators can be used to 
determine its distance. Since we can resolve individual stars in the globular 
clusters surrounding the LMC, we can estimate the distance to the system 
by main-sequence fitting (§3.5.2). For example, Walker (1985) observed the 
young globular cluster NGC 1866, and used the position of its main sequence 
in the CM diagram to determine a distance modulus for the cluster of m — 
M = 18.5 + 0.2, which corresponds to a distance of 50 + 5kpc. 

Analyses of variable stars (§7.3.1) yield consistent results with somewhat 
smaller uncertainties. In their review of the use of Cepheids as distance 
indicators, Feast & Walker (1987) find that the period-luminosity relation 
of equation (5.19) places the LMC Cepheids at a mean distance of 50.3 kpc. 
Interestingly, the derived distance depends on which part of the LMC is 
observed: Cepheids at the eastern edge of the LMC are 5% closer than those 
at the western edge (Gascoigne & Shobbrook 1978). This differential arises 
because the Cepheids populate the plane of the highly-flattened LMC, which 
is tilted with respect to the line of sight. Our ability to measure this small 
difference illustrates once again the power of relative distance indicators when 
absolute calibration is not required. 

The properties of fainter variable stars provide a further check on the 
distance to the LMC. A compilation of the RR Lyrae stars detected in 6 LMC 
globular clusters by Walker (1992a) reveals that on average they have mean 
apparent magnitudes of 18.9, with a dispersion from cluster to cluster of 
~ 0.1 mag. The average metallicities of these RR Lyrae stars were measured 
to be [Fe/H] ~ —1.9, and so adopting relation (5.24) we find that their mean 
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Table 7.2 LMC distance estimates 


Method Distance/kpc 
Main Sequence Fitting 50+ 5 
Cepheids 50+2 
RR Lyrae 44+2 
SN1987a time delay 5243 
SN1987a Baade—Wesselink method 55+5 


absolute magnitudes are (My) = 0.7. The clusters must thus lie at distances 
in the range 44+2kpc. Since the LMC subtends an angle of some 10 degrees 
on the sky, it must have a radius of ~ 5kpc, and so the scatter in distance 
estimates to globular clusters can all be explained if they are distributed 
in a halo around the LMC. The difference between the average RR Lyrae 
distance to the LMC and that derived from the Cepheid variables and main- 
sequence fitting led Walker (1992a) to suggest that the absolute calibration 
of equation (5.24) should be adjusted by ~ 0.2 magnitudes. 

The detonation of Supernova 1987A provided independent methods 
for determining the distance to the LMC. The application of the Baade- 
Wesselink method to the supernova’s expanding photosphere returned a dis- 
tance of 55 + 5 kpc (see §7.2.1), while the circum-supernova ring time delay 
gave a value of 52 + 3 kpc (see §7.2.3). These absolute distance determi- 
nations are not dependent on any intermediate calibration, but, as we have 
discussed, they are subject to various systematic errors. Since the sources 
of uncertainty in the calibrated and absolute distance determinations are so 
different, their close agreement as to the distance to the LMC suggests that 
neither approach has been unduly compromised by errors. 

A summary of the results from the various methods for determining 
the distance to the LMC can be found in Table 7.2. There is a reasonable 
consensus from these disparate measurements that the center of the LMC 
lies approximately 50 kpc away from us. Averaging together the estimates in 
Table 7.2, we obtain a best estimate for the distance to the LMC of 50+2 kpc, 
or a distance modulus of m — M = 18.5+0.1. We can thus determine this 
distance to better than ~ 5%. 


Distance to M31 The Andromeda Galaxy, M31, provided the first proof 
of the existence of an extragalactic distance scale through the detection by 
Hubble (1922) of its Cepheids. As the nearest large spiral galaxy (besides 
the Milky Way), it still provides an important calibration point on the extra- 
galactic scale. Since it lies significantly further than the LMC, several of the 
fainter classes of object used as distance estimators are no longer applicable. 
There is, however, still a respectable range of applicable distance estimators. 

Recent observations of Cepheids in M31 have tied down the relation 
between their periods and apparent magnitudes quite closely. The data are 
now so good, in fact, that the error in the derived distance is dominated by 


436 Chapter 7: The Cosmic Distance Scale 


Table 7.3 Andromeda Galaxy (M31) distance estimates 


Method Distance/kpc 
Cepheids 760 + 50 
RR Lyrae 750 + 50 
Planetary Nebula Luminosity Function 750 + 50 
Globular Cluster Luminosity Function 700 + 60 
Novae 710 + 80 


the uncertainty in the zero-point calibration. Distances are therefore often 
quoted with respect to the LMC, so that there is no contribution to the error 
in distance from the zero-point; a recalibration of the scale only affects the 
distance to the LMC, and all other distances scale up or down accordingly. 
For example, recent comparisons between Cepheids in the LMC and those 
in M31 imply that M31 is 15.3+ 0.8 times as far away (Freedman & Madore 
1990). If we adopt a distance of 50 + 2 kpc for the LMC, this places M31 
at 760 + 50 kpc. Similarly, observations of the apparent magnitudes of RR 
Lyrae stars in M31 compared with those in the LMC implies that the ratio 
of distances is 15 + 1. 

The distance estimates that are based on the properties of luminosity 
functions (§7.3.2) have also been applied to M31. Jacoby, Walker & Cia- 
rdullo (1990) have compared the planetary-nebula luminosity functions of 
the LMC to that of M31, and find that the distributions of apparent bright- 
ness for planetary nebulae are consistent with a single intrinsic luminosity 
function if the distance to M31 is 15 + 1 times the distance to the LMC, in 
excellent agreement with the other estimators. Similar results have also been 
obtained from globular clusters. The peak of the globular cluster luminosity 
function lies at an apparent V-band magnitude (corrected for reddening) of 
16.8+0.2 (Racine & Harris 1992). Assuming that this luminosity function is 
intrinsically the same as the Milky Way’s which peaks at an absolute mag- 
nitude of —7.4+ 0.1 (see §7.3.2), we obtain a distance modulus for M31 of 
24.2 +0.2, which corresponds to a distance of 700 + 60 kpc. This last value, 
which is consistent with the other cited estimates, is calibrated directly with 
respect to the Milky Way rather than by way of the LMC. The good agree- 
ment between the values adds confidence not only to the determination of 
the distance to M31, but also to the adopted distance to the LMC and the 
assumption that the globular cluster luminosity function is universal. 

A summary of recent determinations of the distance to M31 is presented 
in Table 7.3. As these results show, there is excellent agreement between the 
different methods. Combining the results weighted appropriately by their 
errors, we obtain a best estimate of 740 + 40 kpc, with a corresponding 
distance modulus of m —~ M = 24.3+0.1. Where appropriate, the errors also 
include the contribution from the uncertainty in the distance to the LMC. 

Due to its proximity, M31 has the best-determined distance of any large 
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galaxy. It therefore plays a key role in calibrating distance estimators that 
are based on such galaxies. For example, for the surface-brightness fluctu- 
ation method (§7.3.5) we can calibrate the variation with V — J color in 
equation (7.32) using galaxies in clusters at unknown distances, but the ab- 
solute calibration of the relation — the constant on the right-hand side of 
this formula — requires surface brightness fluctuation analysis for a galaxy at 
known distance. The bulge of M31 was found by Tonry (1991) to have sur- 
face brightness fluctuations [as defined by equation (7.30)] with an amplitude 
in magnitudes of (my) = 23.16. Assuming that M31 lies at a distance of 740 
kpc, this value corresponds to an absolute magnitude of (M;) = —1.2. Since 
the bulge of M31 has a color of V —I = 1.19, the constant in equation (7.32) 
follows. 


7.4.2 Distances beyond the Local Group 


As can be seen from Figure 7.7, a good range of techniques is available for 
measuring distances to galaxies immediately beyond the local group. Some 
of these methods are based on the properties of the galaxies themselves, while 
others use their brightest inhabitants. When more than one method can be 
used to calculate the distances to galaxies, the results for the different tech- 
niques give respectable agreement out to distances beyond 20 Mpc (Jacoby et 
al. 1992). Figure 7.8 shows the comparison between distances to clusters of 
galaxies obtained by using the surface brightness fluctuation method on their 
elliptical members (§7.3.5) and those obtained by a number of other methods. 
The agreement with distances obtained by analyzing planetary nebula and 
globular cluster luminosity functions (§7.3.2) is very good. The distances 
derived using the Tully-Fisher relation (§7.3.4) for spiral galaxies in these 
clusters show significantly poorer agreement. Since there are also systematic 
differences in redshift between ellipticals and spirals in the clusters for which 
the agreement is poorest (Jacoby et al. 1992), it seems likely that some of 
the disagreement arises because the ellipticals from which surface-brightness 
distances are derived and the spirals are not intermixed in the same physical 
association. This disagreement suggests that the Tully-Fisher relation is not 
suitable for obtaining the distances to clusters of galaxies. The distances 
to elliptical galaxies obtained via the D,—o relation also show a degree of 
scatter relative to the surface-brightness distances, although the larger errors 
on individual D,-—o relation distances is at least partially responsible. 


Distance to the Virgo Cluster The Virgo Cluster is the nearest mod- 
erately large cluster of galaxies. Since the cluster contains both spiral and 
elliptical galaxies in large numbers, and has also been the site of a number of 
well-observed supernovae, it provides an ideal calibration point for compar- 
ing the results obtained using different distance estimators. Moreover, once 
its distance has been determined, we can also infer the absolute properties 
of what is presumably a fairly average cluster of galaxies; such properties 
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Figure 7.8 Comparison between the distances to nearby galaxies derived by the surface 
brightness fluctuation method, and those obtained using the planetary nebula and globular 
cluster luminosity functions, the Tully-Fisher relation for spiral galaxies, and the Dn-o 
relation for ellipticals. [From data kindly provided by J. Tonry] 


might then form the calibration point for techniques to derive the distances 
to other, more distant clusters. 

A wide range of techniques have been applied to measuring the distance 
to the Virgo Cluster. In their extensive review of the subject, Jacoby et 
al. (1992) showed that the three methods with the smallest uncertainties 
(surface-brightness fluctuations, planetary-nebula luminosity function, and 
the Tully-Fisher relation) all provided consistent distance estimates of ~ 
16+1Mpc (m— M = 31.040.1). 

The one seriously conflicting measurement comes from the analysis of 
type Ia supernovae. The Cepheid calibration of the brightnesses of nearby 
supernovae implies that their absolute magnitudes are Mg = —19.7+0.1 at 
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maximum light (§7.3.3), which implies that at a Virgo Cluster distance of 
16 Mpc, their brightest apparent magnitudes should be mg = 11.340.1. In 
fact, they should appear somewhat fainter than this value due to absorption 
both in the Milky Way and in their host galaxies. Capaccioli et al. (1990) 
have compiled a sample of 10 type Ia supernovae in Virgo Cluster galaxies. 
Using the colors of the supernovae to measure the amount of reddening, they 
find that the extinction-corrected apparent magnitudes of these objects have 
a mean of mg = 12.1+0.1. This value is significantly fainter than the 
prediction. In order to reconcile the observed apparent magnitudes with the 
calibrated absolute magnitudes of the supernovae, the Virgo Cluster would 
have to lie at a distance of 2342 Mpc (m—M = 31.8+0.2). As we have seen 
in §7.3.3, some doubt has now been cast on the role of type Ia supernovae as 
standard candles. Non-standard luminosities for the few supernovae used to 
calibrate their absolute magnitudes could well explain the discrepancy, but 
this suggestion has yet to be explored in detail. 

In an effort to resolve this conflict once and for all, enormous efforts have 
been expended to detect Cepheid variable stars in the Virgo Cluster. Metic- 
ulous observations of the Virgo galaxy NGC 4571 using the Canada-France- 
Hawaii Telescope by Pierce et al. (1994) revealed three candidate Cepheids 
at an apparent magnitude of mpg ~ 24. The periods derived for these very 
faint stars consistently follow the expected Cepheid period-luminosity rela- 
tion if the galaxy is placed at a distance of 14.9+1.2 Mpc. Subsequently, the 
unparalleled imaging quality of the Hubble Space Telescope enabled Freed- 
man et al. (1994) to identify 20 Cepheids in the Virgo galaxy M100, and the 
period-luminosity relation for these stars implies that this galaxy is at a dis- 
tance of 17.1+1.8 Mpc. Given the errors on these estimates and the intrinsic 
size of the Virgo Cluster (a few megaparsecs in depth), these measurements 
are entirely consistent with a Virgo-Cluster distance of 16 +1 Mpc. 

A summary of the recent determinations of the distance to the Virgo 
Cluster is presented in Table 7.4. Note that the Cepheid determination 
involves fewer steps to establish the distance scale, since it goes directly from 
Galactic measurements of Cepheid variables to observations of these stars in 
the Virgo Cluster; all of the other methods involve calibrations using objects 
at intermediate distances. Since systematic errors at each calibration point 
generate the worst uncertainties in the cosmic distance scale, correspondingly 
greater weight should be given to the Cepheid measurement. 


Peculiar velocity field We can use the various distance indicators to 
search for local departures from the Hubble law. If the Hubble law is obeyed 
exactly, then a plot of the derived distance versus redshift for a sample 
of galaxies should obey the straight-line relation of equation (7.2). Any 
departures from this straight line can be attributed to the contribution to 
the redshift from local peculiar velocities. Happily, this procedure is not 
critically dependent on the uncertain absolute calibration of the distance 
indicators: a change in the calibration will change the slope of the distance- 
redshift relation, but any departures from the straight-line relation will still 
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Table 7.4 Virgo Cluster distance estimates 


Method Distance/Mpc 
Surface Brightness Fluctuations 1641 
Planetary Nebula Luminosity Function 15+1 
Cepheids 16+1 
Tully-Fisher Relation 16+2 
D,-—o Relation 17+2 
Type Ia Supernovae 23 +2 
Globular Cluster Luminosity Function 1944 
Novae 2144 


SOURCE: adapted from Jacoby et al. (1992) 


be apparent. The best available distance estimators have uncertainties of ~ 
10%, so only relatively large disturbances to the Hubble flow are measurable. 
The large errors also mean that Malmquist effects can significantly distort 
the apparent departures from the Hubble law, and steps must be taken to 
minimize these biases (see §3.6.1). 


The best-established large-scale departure from the Hubble flow is the 
Virgocentric infall, which has been investigated by many authors. By com- 
paring distances to 300 spiral galaxies derived from the infrared Tully-Fisher 
relation (87.3.4) to those predicted from the galaxies’ redshifts, Aaronson et 
al. (1982) found systematic residuals, which can be understood if there is a 
large-scale motion of galaxies towards the Virgo Cluster. Since this cluster is 
the closest large concentration of visible material, it should not be surprising 
if it also represents a local maximum in mass density which gravitation- 
ally attracts surrounding material, causing the observed departures from the 
Hubble flow. 


Using the D,~—o relation (§7.3.4) for a sample of 400 elliptical galaxies, 
Lynden-Bell et al. (1988) looked at the distribution of peculiar velocities out 
to higher redshifts. In addition to the Virgocentric infall, they found strong 
evidence for a bulk flow of galaxies towards a point in the constellation of 
Centaurus, which has been dubbed the Great Attractor. This region is 
heavily obscured since it lies close to the Galactic plane, so it is difficult to 
observe the attractor itself in any detail. However, there is certainly a large 
concentration of clusters of galaxies in this part of the sky, and so it seems 
likely that the Great Attractor represents the center of this concentration, 
the Local Supercluster. To give some idea of the gravitational importance 
of this supercluster on the Hubble flow, the influence of the Virgo cluster has 
induced an infall of ~ 250kms~!, whereas the motion of the Local Group 
caused by the much more distant Great Attractor is close to 600kms7!. 


7.4 Results 441 


7.4.3 The asymptotic Hubble constant 


The ultimate aim in establishing the cosmic distance scale was to reach to 
sufficient distances to measure the Hubble flow directly. Not only does the 
Hubble law set the ultimate scale of the Universe via equation (7.2), but it 
also provides us with an estimate of the age of the Universe via equation (7.4). 

Unfortunately, as we have seen in §7.4.2, even the Virgo Cluster is not 
sufficiently distant for its motion to be dominated by the Hubble flow, and so 
we cannot estimate the Hubble constant simply by substituting this cluster’s 
distance and redshift into equation (7.2). We could use a model of the pecu- 
liar velocity field in order to try to remove this contaminant from the Virgo 
Cluster’s cosmological redshift. However, the size of the correction required 
is rather uncertain: estimates of the size of the Local Group’s peculiar infall 
velocity towards Virgo have ranged from —90kms~! to 520kms~! (Davis & 
Peebles 1983), and there is still an uncertainty of several hundred kilometers 
per second in the size of this motion. Since the total redshift of the Virgo 
Cluster is only 1035 kms~1 (Huchra 1984), the uncertainty in the infall cor- 
rection produces a considerable uncertainty in the cluster’s cosmic redshift, 
so it is not possible to obtain an unambiguous determination of the Hubble 
constant from Virgo Cluster data. 

We must therefore look to more distant objects. One obvious target is 
the Coma Cluster. This rich cluster of galaxies is the closest such system 
and has therefore been studied in great detail. With a heliocentric velocity 
of vc = 6925kms~? (Fukugita et al. 1991), the uncertain correction for the 
Local Group’s infall towards Virgo is not a significant factor when deriving 
the Hubble constant. The Coma Cluster is sufficiently distant that many of 
the techniques which have been used to measure distances beyond the Local 
Group can no longer be applied (see Figure 7.7). For example, although 
globular clusters have now been detected around Coma Cluster galaxies (see 
§7.3.2), the observations do not yet reach faint enough magnitudes to measure 
the peak of their luminosity function, so this standard candle is not available 
to us. 

Since rich galaxy clusters like Coma are mostly populated by elliptical 
galaxies, the various elliptical kinematic estimators such as the D,-o relation 
(see §7.3.4) provide the best constraints on the Coma Cluster’s distance. The 
poor absolute calibration of these relations means that they are best suited to 
measuring the Coma Cluster’s distance relative to that of the Virgo Cluster. 
This comparison, based on the D,,~o relation, yields the result that the Coma 
Cluster is ~ 5.5 times more distant than the Virgo Cluster (Dressler et al. 
1987). In fact, because relative distance measurements such as this one are 
not dependent on uncertain zero-point calibrations, there is generally good 
agreement between all the different methods for obtaining this distance ratio. 
A compilation by van den Bergh (1992) showed that 8 out of 12 distance 
determinations based on cluster galaxy properties returned relative distances 
to these two clusters in the range 5.3 — 5.8. A standard candle analysis of the 
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type Ia supernovae that have exploded in both clusters yielded a consistent 
value for the ratio of 5.6 (Capaccioli et al. 1990). Combining these various 
analyses, we obtain a best estimate for the Coma—Virgo distance ratio of 
5.5+ 0.2. 

Taking this relative distance measure in conjunction with the observed 
Hubble-flow velocity of the Coma Cluster, we can parameterize the Hubble 
constant in terms of the distance to the Virgo cluster, Dy, as follows: 


Ho = (63 + 2)(Dy/20 Mpc)~? kms~? Mpc7* (7.34) 


Thus, if we adopt Dy = 16+1 Mpc as the distance to the Virgo Cluster 
then Hp = 79+6kms~!Mpc™?. If, on the other hand, the Virgo distance 
determination based on type Ia supernovae, Dy = 23 +2 Mpc, is correct, 
then Hp = 55+5kms~! Mpc. 

Type Ia supernovae are sufficiently bright that they can be observed 
directly out to distances where their redshifts are dominated by the Hubble 
flow (see Figure 7.7). Since all methods agree fairly closely on relative dis- 
tances beyond the Virgo Cluster, the calibration of type Ia supernovae which 
returns a large distance to the Virgo Cluster also produces larger estimates 
than other methods for more distant objects. The Hubble constant derived 
directly from type Ia supernovae thus also lies at the low end of the range, 
with a typical estimate of Hp = 47+5kms~! Mpc! (Sandage & Tammann 
1993). 

Here, then, is where most of the residual discrepancy between the long 
and short distance scales rests. The advocates for the short distance scale 
point out the close agreement between a wide range of distance estimators 
which all imply a small distance to the Virgo Cluster, and a correspondingly 
large Hubble constant. Those who favor the long distance scale use the 
supernova type Ia distance scale as the primary evidence for a low value 
of the Hubble constant, and emphasize the systematic errors in less direct 
standard candles to explain their discrepant distance estimates to the Virgo 
Cluster and beyond. 

There are, however, distinct signs of convergence between these two 
deeply-entrenched camps. Recalibration of the absolute magnitudes of seven 
type Ia supernovae using HST observations of Cepheids in their host galaxies 
led Saha et al. (1995) — staunch advocates for the long distance scale — to 
estimate that Hy = 58'{kms~!/Mpc. One can, therefore, now quote a value 
of 

Ho = (65 + 10) kms~* Mpc™* (7.35) 


without upsetting anyone unduly. 

It is also notable that the absolute distance measures are starting to con- 
verge toward similar values. In the case of the SZ effect (see §7.2.2), better 
observations of both the X-ray properties of clusters and the distortions in the 
microwave background are leading to larger inferred values of the Hubble con- 
stant. Thus, instead of obtaining estimates like Hp = 24k kms~! Mpc! 
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(McHardy et al. 1990), more recent SZ measurements result in values of be- 
tween 60 and 80kms~! Mpc~’ (Holzapfel et al. 1997). Similarly, Kundié et 
al. (1997) have found that the gravitational lens time delay method (§7.2.3) 
applied to QSO 0957+561 yields Hy = (64413) kms“! Mpc!. The skepti- 
cal reader might invoke the band-wagon effect discussed above in the context 
of determining Roy (see §7.4.1) to explain this convergence. Nevertheless, it is 
heartening that such a broad variety of methods can return values consistent 
with equation (7.35). 

However, there remains one factor which strongly suggests that we 
should favor values for Hp toward the lower end of the range in equa- 
tion (7.35). For any adopted value of the Hubble constant, one can es- 
timate the age of the Universe using equation (7.4). If we adopt Hp = 
65kms~! Mpc ! then to = Hy * = 1.5 x 10?° years. This value is alarmingly 
close to the ages of the oldest globular clusters, which stellar evolution cal- 
culations find to be also ~ 1.5 x 10° years (§6.1.4). If both these numbers 
are correct, then globular clusters must have formed surprisingly early in the 
evolution of the Universe. In fact, as we discuss below, the Hubble time tp 
provides only an upper limit to the age of the Universe, so even this marginal 
consistency between cosmological and stellar ages breaks down. As we have 
seen in Chapter 6, future progress in reconciling these numbers requires that 
we look carefully at the globular cluster age determinations as well as refining 
the more direct determinations of Ho. 


One remarkably simple error, which would go a long way toward ex- 
plaining the conflict between globular cluster age estimates and the age of 
the Universe, is that we may have adopted too faint an absolute magnitude 
calibration for the main sequence. Such an error would have two effects. 
First, it would mean that we have underestimated the distances to clusters 
obtained by MS fitting. Since the cosmic distance scale has traditionally 
been anchored by such MS fits, this error will propagate out to large scales, 
so that all distances will be similarly underestimated. Thus, the current 
measurements for Hp will be overestimated, and the calculated age of the 
Universe, to x H+ [equation (7.4)| will be too small. Some of the age es- 
timates for globular clusters will also be affected by adopting too faint an 
absolute magnitude scale for the MS. Specifically, the adopted absolute mag- 
nitudes of the turn-off points from the MS will be too faint, and thus the 
ages calculated from equation (6.2) will be systematically too small. Hence, 
if the absolute magnitude calibration for the MS is made brighter, then the 
estimates for the age of the Universe become larger and the ages of globular 
clusters become smaller. A change of only —0.2 magnitudes in this calibra- 
tion would be enough to reconcile the difference (see Problem 7.7), and there 
are already indications that an alteration of this order may be appropriate 
(Feast & Catchpole 1997). 
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7.4.4 The deceleration parameter and cosmic density 


As we have seen in §7.1, the value of the deceleration parameter, qo, is as 
intriguing a cosmological quantity as the Hubble constant. Just as Hp can 
be used to estimate the age of the Universe, so the value of go predicts its 
ultimate fate: if gg < 0.5 then the Universe will expand forever, whereas 
if go > 0.5 then the Universe will ultimately stop expanding and collapse 
back on itself. The value of go also has implications for the exact age of the 
Universe. Gravitational deceleration means that the Universe is currently 
expanding at a lower velocity than was the case in the past. In using the 
Hubble time, to, as an estimate for the age of the Universe, we neglect this 
effect and overestimate the true value. For example, if go = 0.5 then the 
actual age of the Universe is only 2t) (Problem 7.1). 

We already know that the deceleration parameter cannot differ from the 
critical value of gg = 0.5 by orders of magnitude. If gg were very large, then 
the expansion of the Universe would already have been halted and the Uni- 
verse would have collapsed back to on itself. Conversely, a very small value of 
go would imply that there was almost no gravitating matter in the Universe, 
and this implication conflicts with the large amount of material in galaxies, 
clusters of galaxies, etc, that we observe. Is it just a coincidence that qo 
lies reasonably close to the critical value? We can appeal to anthropocentric 
arguments to explain the coincidence: if gg had a more extreme value, then 
there would either be no Universe or no galaxies at the current time, and 
hence no human life could have evolved to ponder the question. However, 
there may also be fundamental physical reasons why the density of the Uni- 
verse should be close to the critical value, p,, and hence why go © 0.5. In 
particular, inflationary theories of cosmology imply that the early Universe 
underwent a period of enormously rapid expansion which should have driven 
its average density very close to p, [see Guth (1989) for a non-mathematical 
description of this theory]. The observational determination of gg would en- 
able us to test these theories directly and see whether the Universe really is 
critically balanced between eternal expansion and ultimate collapse. 


Standard candles and rulers Given the current uncertainties in the 
determination of the Hubble constant, it seems unlikely that the subtler 
cosmological effects of the deceleration parameter will be detectable in ob- 
servations of standard candles and rulers. In one sense, however, go is easier 
to measure than Hp. The Hubble constant can only be calculated from the 
apparent brightness of a standard candle via equation (7.7) if the source’s 
intrinsic luminosity is known. However, go can in principle be derived by 
observing a sample of standard candles of unknown luminosity. If we pro- 
duce a plot of distance modulus versus the logarithm of redshift (known as 
a Hubble diagram) for a set of standard candles, then at low redshifts the 
Hubble law implies that we should see a straight line. However, at high red- 
shifts the departure from the linear Hubble law in equation (7.7) means that 
the relation will curve in a way which depends solely on go. If the absolute 
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magnitude of the standard candles is unknown, then there will be an indeter- 
minate offset in the vertical position of the locus occupied by the standard 
candles in the Hubble diagram, but the characteristic curvature in the locus 
due to go will be preserved. We should thus be able to solve directly for qo 
from the shape of this relation without knowing the intrinsic luminosity of 
the standard candles. 

Unfortunately, such analyses are plagued by a deep-rooted problem. 
Specifically, the light travel time from putative standard candles at high red- 
shift is a significant fraction of the age of the Universe, and almost all objects 
will have evolved significantly over this timescale. The unknown variation in 
intrinsic luminosity during this evolutionary process is likely to dominate the 
departure from a straight line in the Hubble diagram, swamping the subtler 
cosmological effects. For example, Hoessel, Gunn & Thuan (1980) have found 
that if the brightest galaxies in rich clusters are assumed to be standard can- 
dles, then the cosmological model which provides formally the best fit to the 
observed Hubble diagram of these objects would imply that gg ~ —0.5 — the 
Universe appears to be expanding at an accelerating rate. This unphysical 
result can be explained straightforwardly if brightest cluster galaxies are not 
standard candles, but are intrinsically fainter at high redshifts. This lumi- 
nosity evolution is exactly what we would expect if such galaxies grow over 
time by “cannibalizing” some of their neighbors (Hausman & Ostriker 1978). 
Since we do not understand the details of this evolutionary process, it is not 
possible to disentangle its effects from those of the cosmological model, and 
so qo cannot be estimated from these data. 

Recently, some progress has been made toward measuring go by focus- 
ing on objects for which we expect the evolutionary effects to be small. One 
appealing class of object for such a study is the type Ia supernova. The 
simplicity of the explosion producing such objects suggests that evolution- 
ary effects should be small, and they are bright enough to be seen right 
across the Universe. Garnavich et al. (1998) and Perlmutter et al. (1998) 
have made very high quality observations with HST and ground-based in- 
struments, which have enabled them to follow the light curves of several type 
Ia supernovae at high redshift. Allowing for the possible variations in su- 
pernova luminosity with color (see §7.3.3), these collaborations were able to 
calculate the distance moduli for these supernovae. As Figure 7.9 illustrates, 
the remarkably tight linear relation at low redshift in the resulting Hubble 
diagram shows what good standard candles type Ia supernovae can provide. 
Although the data are still rather sparse, there are strong indications that 
the departures from the linear Hubble law at large redshifts follow what one 
would expect if go is small; the data in Figure 7.9 rule out a value of go = 0.5 
at the 95% confidence level. 

A conflicting result, which favors a high value of gg, has been obtained 
by Kellerman (1993). He has examined the angular extents of compact active 
galaxy radio sources using very high resolution imaging techniques. These 
sources have typical angular extents of ~ 0.01 arcseconds, and it has been 


446 Chapter 7: The Cosmic Distance Scale 


0.01 0.1 1 


Figure 7.9 Hubble diagram, showing the distance moduli of type 
Ia supernovae as a function of their redshifts. The solid line shows 
the predicted relation for gg = 0 (2 = 0), while the dotted line 
shows the relation for gg = 0.5 (2 = 1). The bottom panel shows 
the difference between data and models. [After Garnavich et al. 
(1998) courtesy of P. Garnavich] 


suggested that their small size means that their properties do not depend on 
the largescale structure of their surroundings. These sources might, there- 
fore, make useful standard rulers which are relatively unaffected by large- 
scale evolution in the surrounding environment. Figure 7.10 shows the an- 
gular diameters for a sample of 82 such radio sources as a function of their 
redshifts. Although the angular extents of these sources initially decreases 
with redshift, they seem to stop getting smaller at a redshift of around unity. 
In fact, within the uncertainties the plot seems to follow the relation pre- 
dicted by equation (7.8) for go = 0.5. Either unexpected evolutionary effects 
have somehow conspired to produce a variation in apparent angular size with 
redshift which mimics the effects of a critically-bound Universe, or these re- 
sults imply that go ~ 0.5. 
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Figure 7.10 Angular size versus redshift for a sample of compact 
radio sources. Solid lines indicate the expected variation in angular 
size of a standard ruler with redshift for different values of gg. The 
dashed line shows the “Euclidean” result for which angular size 
varies as z~!. [After Kellerman (1993)] 


Clearly, we are only just beginning to make measurements of the quality 
required to tie down qo. If we are to determine whether we live in a critically- 
bound Universe with go = 0.5, as many theorists and aesthetes would like, 
we really require other measures of the mean density of the Universe. 


Peculiar velocity field In the above analysis, we have described how qo 
might be measured using properties of the global geometry of the Universe. 
We can also approach this problem using more local observations of the 
peculiar velocity field (§7.4.2). Since these departures from the Hubble flow 
are believed to be induced gravitationally by large-scale variations in density, 
galaxies’ peculiar motions provide a direct probe of the cosmic density and 
hence the deceleration parameter. 

As we have seen in §7.4.2, major components of the peculiar velocity 
field involve large-scale streaming towards the Virgo Cluster and the Great 
Attractor, and these motions can be explained by large mass concentra- 
tions at these locations. However, attempts to model the peculiar velocity 
field using isolated gravitationally-attractive mass concentrations are clearly 
over-simplifications. In reality, the cosmic density distribution will be a con- 
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tinuously fluctuating function containing regions of over- and under-density 
on many different scales. Recent analyses have therefore concentrated on us- 
ing non-parametric techniques for recovering the complete three-dimensional 
distribution of density from the observed three-dimensional distribution of 
peculiar line-of-sight velocities. A description of the practical implementa- 
tion of such techniques and their results can be found in the review by Dekel 
(1994). These methods confirm the presence of large mass concentrations 
in the direction of the Virgo Cluster and the Great Attractor, but they also 
detect the influence of other gravitational perturbations such as the Perseus- 
Pisces supercluster. 

These analyses are very effective at measuring the fluctuations in the 
density distribution. Unfortunately, they are insensitive to the contribution 
to the cosmic density from any uniformly-distributed component, as such a 
mass component does not produce local peculiar velocities. Thus, we cannot 
measure the total density directly using this approach. However, we can 
infer the cosmic density if we assume that the density distribution is related 
to the distribution of galaxies. If we compare the variations in the number 
density of galaxies to the fluctuations in the density distribution derived from 
the peculiar velocities, we find that the over- or under-density of galaxies is 
approximately proportional to the density fluctuations (Dekel et al. 1993). 
It therefore seems that the distribution of galaxies traces the mass in the 
Universe quite closely, so we might legitimately calculate a cosmic mass-per- 
galaxy ratio by comparing these two distributions. As long as the same ratio 
applies to the fraction of galaxies that are more uniformly distributed, we 
can calculate the total cosmic mass density at any point by multiplying the 
total number density of galaxies by this mass-per-galaxy ratio. Calculations 
along these lines have been made by a number of authors; see Dekel (1994) 
for a compilation of the results. The consensus from these analyses is that 
the cosmic density lies quite close to p,, consistent with a value of go equal 
to, or perhaps somewhat below, a half. 

If the value of qo is really as high as a 0.5, then the conflict between the 
age of the oldest globular clusters (~ 1.5 x 101° years) and the age implied 
by Ho (87.4.3) is exacerbated. For go = 0.5, the age of the Universe is only 
2H5"; if Hp = 65+10kms~!Mpc~", the implied age of the Universe is a 
mere (1.0 + 0.2) x 101° years. In order to make the Universe older than the 
globular clusters, we require a long distance scale with Hp < 45kms~1/Mpc. 
We must therefore either adopt this long distance scale (and be prepared to 
explain why so many observations favor a shorter distance scale), or we must 
look critically at the arguments for a high value of gp and the stellar evolution 
calculations which give the ages of globular clusters.® 


° The Friedmann cosmological models that we have adopted in this chapter neglect 
the possibility of a non-zero cosmological constant, A. If this extra term is added into 
the equations governing the expansion of the Universe, then the deceleration parameter 
is replaced by go = 0.5p0/pc — A/(3HZ). Setting the arbitrary constant A to a positive 


Problems 449 


Problems 


7.1 Many of the properties of an isotropic cosmological model can be derived 
using a simple Newtonian formalism. In particular, consider a spherical 
region in the Universe of radius r and mass M. An object of mass m lying at 
the edge of this sphere only feels the gravitational influence of the material 
within the sphere [Birkhoff’s theorem, the relativistic analog of Newton’s 
first theorem (BT§2.1.1)]. If the Universe is critically-bound, the total energy 
of the object will be zero, and we can write 


=0. 7.36 
5 7 (7.36) 


By integrating this equation forward in time from the Big Bang (r = 0, 
t = 0) to the present day when t = tnow and v = Hor, show that trow = 
(2/3)H, ' = (2/3)to. 


7.2 A star expands in a spherically-symmetric manner with radial velocity 
vp. Defining a spherical coordinate system on the surface of a star with the 
polar axis aligned along the line of sight, show that the measurable flux- 
weighted mean line-of-sight velocity will be 


_ fe I(8) cos? 6 sin 6d 


Vlos = Ur = : (7.37) 
Jo? 1(8) cos @ sin 640 

Hence show that, for a star of uniform brightness, p = v,/viog = 1.5. In 
reality, a star will not appear uniformly bright: its opacity means that near 
the edge of the star (its “limb” ) one cannot peer so far into its atmosphere, 
so one sees the less bright outer layers. A reasonable analytic approximation 
to this limb darkening is given by I(@) = I(0)(0.4+0.6cos@). In this 
approximation, show that p = 24/17. 


7.3 If one were to make X-ray and S-Z decrement observations along the 
long axis of a prolate cluster of galaxies with axis ratio a: b: b, but were 
to assume erroneously that the cluster is a sphere of radius a, show that the 
derived distance to the cluster would be wrong by a factor of b/a. 


7.4 Consider a disk of potentially-masing material following circular Kep- 
lerian orbits about its center. Assume that the disk is viewed edge-on, and 
consider a line of sight that passes through the disk a distance R from its 
center. Measuring the distance along this line of sight, S, such that center 


value can decrease the rate of deceleration in the cosmic expansion, and perhaps reconcile 
the age of the Universe with a high value for Ho. As yet, there is no compelling reason to 
complicate the geometry of the Universe by setting A to a non-zero value [see Carroll, Press 
& Turner (1992) for a review of the effects of this extra term and the current observational 
constraints on its amplitude.] 
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of the disk lies in the plane S = 0, show that the line-of-sight velocity of 
material at position S along the line of sight is given by 


Ulos « R(R? + $7)~3/4, (7.38) 


The production of maser emission requires a long path length of material 
all travelling with the same line-of-sight velocity (so that the emission occurs 
at a single frequency). Hence, show that emission will preferentially be seen 
from the region R ~ 0 and from the region S ~ 0. If the disk is only 
populated with material at radii r > rjy, at what values of yj, and R will 
we detect maser emission? 


7.5 From the statistical parallaxes of 162 RR Lyrae stars of mean metallicity 
[Fe/H] = —1.61, Layden et al. (1996) find (My) = 0.71 40.12. From this 
calibration they derive Ro = 7.6640.4kpc and a distance modulus to the 
LMC of 18.28 + 0.13. By comparing these values with ones given in §7.4, 
determine by how much the value of (My) for RR Lyraes given above differs 
from the more conventional value. 


7.6 Suppose that a series of four different standard candles are used to step 
out along the cosmic distance ladder as far as the Hubble flow, and that the 
calibration of each standard candle carries an uncertainty of 0.2 magnitudes. 
Show that, by changing the calibration for each step within the range allowed 
by its uncertainty, it is possible to derive values for the Hubble constant that 
lie anywhere between ~ 0.7 and ~ 1.4 of the nominal value. 


7.7 Current estimates suggest that the Universe has an age of tp © 12 Gyr, 
while globular clusters have ages of t, ~ 15 Gyr. Show that changing the 
absolute magnitude calibration of the MS by AM magnitudes has the effects 
of: (a) changing the age estimate for the Universe (derived using MS-fitting as 
one of the steps on the distance ladder) by Ato = —0.46A.Mto; (b) changing 
the age estimate for globular clusters (derived from the MS turnoff point, 
equation (6.2)) by At, = 0.85A.Mt,. Hence show that the age of the Universe 
and the ages of globular clusters can be reconciled by a change of AM = 
—0.2. 


8 
The Interstellar Media of Galaxies 


The space between the stars is by no means empty. It contains rarefied 
gas, dust particles, a magnetic field, and relativistically moving electrons, 
protons and other atomic nuclei. These various forms of interstellar energy 
density prove to be quite tightly coupled together, with the result that for 
many purposes they together form a single dynamical entity. The terms 
interstellar medium and interstellar matter, or ISM for short, are 
useful short-hands for this composite dynamical entity. 

Although the ISM accounts for only a small fraction of the mass in 
galaxies, one of the most fundamental differences between galaxies of different 
Hubble types concerns the distribution of their interstellar media. Moreover, 
differences in the stellar contents and light distributions of early- and late- 
type galaxies can in large measure be traced to differences in their interstellar 
media. Hence a study of the ISM is a prerequisite for understanding why 
galaxies are as we observe them. 

Until the early 1980s it was thought that only late-type galaxies had 
significant interstellar media. Advances in X-ray and microwave astronomy 
have now demonstrated that many early-type galaxies also have rich inter- 
stellar media. Hence the difference between early- and late-type galaxies lies 
not so much in the quantity of interstellar matter, as in how the ISM is 
structured. 

Spitzer (1982) provides an exceptionally lucid and non-technical account 
of the theory and observations of the Milky Way’s ISM, while Spitzer (1978) 
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Figure 8.1 Absorption by interstellar Na atoms in spectrum of € 
Ori. [After Hobbs (1968)} 


provides a wealth of technical detail on the physics of the ISM. An outstand- 
ing collection of reviews of every aspect of the ISM will be found in Thronson 
& Shull (1992). 


8.1 How interstellar matter is detected 


We start with a discussion of how interstellar matter is detected and its tem- 
perature, density and so on are measured. Besides its obvious observational 
significance, this review will explain how our understanding of interstellar 
matter has developed gradually as technological advances have enabled us to 
probe more of the many different forms which interstellar matter can take. 
Judging by the number of poorly understood observations of interstellar mat- 
ter in both the Milky Way and external galaxies, our understanding of what 
is going on between the stars is still far from complete and remains liable to 
significant revision. 


8.1.1 Absorption of starlight 


In §3.7 we discussed the absorption and scattering of starlight by dust grains. 
We saw that dust absorbs and scatters blue light more than red, but it gen- 
erally does not discriminate strongly between photons of similar wavelength. 
Interstellar dust is always associated with interstellar gas, which betrays its 
presence by giving rise to sharp absorption lines in stellar spectra. 

Figure 8.1 shows the effect of absorption by interstellar sodium atoms 
on the optical spectrum of « Ori. About six different V-shaped features, 
each about 0.01nm wide, are apparent. Each such feature is thought to 
arise from a different interstellar cloud: neutral sodium atoms absorb light 
of wavelength 589.0 nm in their rest frame, and the differences in the central 
wavelengths of the various features are due to differences of ~ 10kms~? in 
the line-of-sight velocities of the clouds. This interpretation is confirmed by 
the fact that a very similar pattern of absorption occurs near 589.6 nm, which 
is the wavelength of the other line of the sodium D line-doublet. 
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A line-doublet at 393.3nm of once-ionized calcium atoms (Cat) gives 
rise to a similar phenomenon, and our understanding of interstellar space 
took a great stride forward during World War II, when W.S. Adams studied 
Cat absorption in the spectra of about 300 stars. This study revealed for 
the first time the cloudy nature of the ISM. 

Unfortunately, calcium and sodium are the only elements that give rise 
to readily measurable interstellar absorption features in optical spectra and 
their lines do not provide reliable diagnostics of the density and temperature 
of the interstellar medium. The problem is two-fold. Most obviously, one 
expects hydrogen and helium to dominate the ISM, and any extrapolation 
from the mass of trace elements such as Na and Ca to the total mass of the 
ISM is liable to be hazardous. Worse still, only a small fraction of all Na 
and Ca atoms are expected to be in the ionization states that give rise to the 
line-doublets described above; most free sodium atoms will be at least once 
ionized, and others will be locked up in dust grains. Similarly, only a small 
fraction of Ca atoms will be exactly once ionized. Little can be inferred 
about the ISM as a whole from measured column densities of neutral Na 
atoms and Cat ions unless the fractions of all Na and Ca atoms that these 
measured species represent can be estimated. A reliable technique for doing 
this has not been found. 

As Spitzer (1946) early appreciated, ultraviolet spectra are required to 
resolve this problem. The essential point is that, for most atoms and ions, 
the energy required to promote an electron from the ground state to the 
lowest-lying excited state is characteristic of ultraviolet rather than optical 
photons.!_ Observations sufficiently far into the ultraviolet (A < 300nm) 
cannot be made from the ground, and it was only in the 1970s that they 
could be made from space. In the eight years from its launch in August 
1972, the 32-inch Copernicus orbiting telescope obtained ultraviolet spectra 
for over 100 stars, mostly in the spectral region 90nm < A < 115nm. In 
January 1978 the International Ultraviolet Explorer (TUE) was launched. It 
had lower spectral resolution than Copernicus and was confined to longer 
wavelengths (A > 120nm), but it had high sensitivity and could therefore 
study fainter objects. 

The strength of an absorption feature is quantified by the feature’s equiv- 
alent width W - see Figure 3.3. Obviously, as the column density N of ab- 
sorbing atoms along a line of sight to a star increases, the equivalent width 
of any absorption feature to which they give rise increases. A plot of log(W) 
against log(V) is called the curve of growth for the line — see Figure 8.2. 
Near the origin, the slope of the curve of growth is always unity; at small 
column densities the number of absorbed photons is simply proportional to 


1 We are thinking here of the gross structure of atoms. When fine and hyperfine 
splittings are taken into account, much lower-lying excited states often exist. Not only do 
the relevant energies then lie far below the energies of optical electrons, but the transitions 
are invariably forbidden. The 21-cm line of hydrogen is a case in point. 
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the number of atoms available to absorb them. At higher column densities, 
atoms begin to ‘shadow’ one another. Thus once most photons of the right 
energy have been absorbed, adding further atoms to the column does not 
appreciably increase the number of photons absorbed because atoms which 
are added near the observer are hardly exposed to absorbable photons, while 
atoms added near the source star cast the rest of the column into the shade. 
When most absorbable photons are absorbed, one says that the line is sat- 
urated. In a well-resolved spectrum, a saturated line reaches almost to zero 
intensity, while in the (log(), log(W)) plane it lies on the almost horizontal 
or ‘flat’ portion of the curve of growth, as shown in Figure 8.2. Since the 
measured quantity W is very insensitive to N on the flat part of the curve 
of growth, column densities cannot normally be reliably estimated from sat- 
urated absorption lines. Fortunately, a given species is usually capable of 
absorbing photons at a series of different energies, and the cross-section for 
absorption generally decrease strongly with increasing energy. Hence if one 
spectral line is saturated, another at shorter wavelength will not be. In this 
way the abundances of many elements have been determined along the lines 
of sight to several bright stars. 

Figure 8.3 displays the results for one particular star as a plot of the 
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ratio of the abundance of each element to the cosmic abundance? versus 
the temperature at which that element is expected to condense onto dust 
grains. Nearly all elements are less abundant in interstellar gas than in the 
solar system; one says that the elements are depleted in interstellar gas. 
Moreover, the degree of depletion is larger for elements that can condense 
onto grains at high temperature than for more volatile elements. 

Although the curve of growth cannot be used to determine N when 
a line is strongly saturated, in favorable cases accurate values of N can 
nevertheless be extracted from saturated lines. The general idea is to focus 
attention on the ‘wings’ of the line, that is, wavelengths far enough from 
line-center that only a small fraction of the available photons are absorbed. 
In effect, the wings are always unsaturated, and therefore exhibit absorption 
that is proportional to the column density. 

Clearly, to exploit this idea one has to be in a position to determine 
theoretically the relationship between N and the strength of the absorption 
far from line-center. This is possible in the case of the Lya transition of 
neutral hydrogen at Ag = 121.6nm. Since hydrogen is so abundant and the 
cross-section per atom for absorption of Lya photons is large (~ 4000nm7), 
the feature at 121.6nm is generally extremely strongly saturated. In fact, 
the saturation is such that there can be appreciable absorption up to one nm 
from line-center. 

Two factors determine how far from line-center a line will extend. The 
first is Doppler broadening; atoms moving at different speeds absorb photons 
of different energies. The velocity spread of hydrogen atoms in the interstellar 
medium is of order 10kms~!, and this dispersion gives rise to a wavelength 
spread A, ~ (v/c)A & 0.004nm.3 Since the distribution of atomic velocities 
should be Gaussian, whether because they are thermal velocities, or because 
they arise from the random motions of clouds, Doppler broadening yields 
a net absorption cross-section for photons with wavelengths in the interval 
(A+ dA, A) proportional to 

= (A = Ao)? 

p(A) dd = exp me IF (8.1) 

Unfortunately, one cannot accurately predict A, because it depends both on 

the unknown temperature of the absorbing gas and on the random velocities 
of the clouds that lie along the given line of sight. 

The second factor that determines how far from line-center absorption 
extends, is the line’s ‘natural width’. The excited state of the transition is 
not a stationary state of a real atom but only of an idealized atom which does 
not interact with the ambient electromagnetic field. When this interaction is 


2 The cosmic abundance is defined to be the abundance in the early solar system. For 
most elements this is determined from meteorites. 

3 Throughout this chapter we approximate the relativistic Doppler-shift formula by its 
value to first order in u/c 
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taken into account, the excited state is not a stationary state and therefore 
not a state of well-defined energy. By Heisenberg’s uncertainty principle, the 
excited state’s energy is uncertain by an amount 6E = h/r, where r is of 
order of the lifetime of the excited state. Since the energy of the excited state 
is ill-determined, the wavelengths of the photons that excite the transition 
are uncertain too, by an amount A,, the natural width of the line. By the 
uncertainty principle, An/Ao = Ao/cr. The fraction of photons absorbed by 
an atom at rest that have wavelength in the interval (A + dA, A) is given by 
the Lorentzian profile 
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(8.2) 


The natural width of the Lya line, A, = 6 x 10-* nm, is significantly 
smaller than the width A, ~ 4 x 1073nm typical of Doppler broadening. 
Consequently, for |A — Ao| S Ay the frequency dependence of the net ab- 
sorption cross-section along a typical line of sight is essentially determined 
by Doppler broadening. Sufficiently far from line-center the sharp decline of 
equation (8.1) for |A—Apo| > A, will give way to the more gradual asymptotic 
form « |A — Ao|? of equation (8.2). Consequently, sufficiently far from line 
center the absorption cross section will always follow the readily-calculated 
Lorentzian profile. Since A, < Ay, dp will be small in this region and the 
absorption can be readily measured only if the line is strongly saturated. 
The Lya line generally satisfies this criterion and has been used to deter- 
mine the column density of hydrogen atoms required, for example, for the 
construction of Figure 8.3. 

Much of the mass of the ISM turns out to be in molecular rather than 
atomic form. Simple molecules, like atoms, rarely have strong absorption 
lines in the optical wave-band, so that most of what we know about interstel- 
lar molecules derives from spectroscopy at ultraviolet and radio frequencies. 
Therefore we now discuss the ultraviolet spectra. 

Molecules can store energy in three ways: its electrons can be promoted 
to an excited state; the distance between the nuclei can oscillate as if the elec- 
tron cloud that unites them were a spring and they were vibrating masses; 
the nuclei can orbit around each other as if the molecule were a rigid rotating 
body. The complete set of energies permitted by these three quantized pos- 
sibilities is complex. Fortunately only a small fraction of all possible energy 
levels plays a significant role astrophysically. 

Not surprisingly, H2 is the most abundant molecule. All the transitions 
of Hz that give rise to measurable ultraviolet absorption lines, start from 
either the ground state, or a state reached by adding a small amount of 
spin to the ground state. The molecule’s spin is measured by the quantum 
number J = 0,1,... which can only change by +1 when a photon is absorbed 
or emitted. Moreover, all observed transitions involve the promotion of the 
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Figure 8.4 Absorption by Hz molecules on the line of sight to ¢ 
Ophiuchi. [After Spitzer & Jenkins (1975)] 


electrons to their first excited state.4 In addition, on absorbing a photon the 
molecule may be left vibrating. The amplitude of vibration is characterized 
by the quantum number v = 0,1,..., which can change by any amount on 
absorption or emission of a photon. 

Promoting the electrons to their first excited state requires 11.2 eV of en- 
ergy. Increasing v by one requires about 0.15 eV, while changing the angular- 
momentum quantum number from J — 1 to J requires about 0.015J eV. 
Hence the relevant photons are ones with energies ~ 11.5eV and wavelengths 
~ 100nm.> Changing v by 1 will shift the wavelength by ~ 1.3nm, while 
changing J will induce a further wavelength shift of order 0.13J nm. Hence 
the absorption spectrum of H» consists of clusters, or bands, of a handful 
of lines each, each band being separated by ~ 1.3nm from the next. 

Figure 8.4 shows an example of absorption by Hz molecules. The two 
most prominent features are clearly saturated and the column density of 
molecules giving rise to them can be determined from their wings, in the 
same way that the column density of hydrogen atoms is found from the wings 
of the Lya line. One of these saturated features is caused by molecules in 
the ground level, and the other by molecules in the first spin-excited level, 
J = 1. Knowing the relative populations N(J) of these two levels, enables 
one to determine the temperature of the absorbing gas from Boltzmann’s 
formula 


<A“! = (23 + 1) exp [— J(J +1)85K/T]. (8.3) 


(The factor of 2J + 1 arises from the degeneracy of the J‘ level.) The 
temperatures determined in this way range from 45 K to 130 K with a typical 
value of 80K. From equation (8.3) it is clear that at such temperatures few 
molecules will be in levels with J > 1. A complication that should be 


4 Rotation transitions can be made radiatively only if the molecule has a non-zero 
electric dipole moment. A homonuclear molecule such as Hz has a non-zero electric dipole 
moment only in excited electronic states. 

5 The energies and wavelengths of photons are related by E/ eV = 1240nm/A. 
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mentioned is that in lower-density clouds the ratio N(1)/N(0) may deviate 
significantly from the thermal value of equation (8.3) because molecules tend 
to be left in states of high J when they return to their lowest electronic and 
vibrational level after being excited by a photon. Such non-thermal photo- 
excitation is overwhelmed by thermal collisional excitation only when the 
particle density is higher than ~ 1074 m~3. 

Figure 8.5 plots values obtained from the ultraviolet spectra of 23 stars 
of the quantities 


N(Htot) = N(HI) +2N(H2) and fy, = Nate (8.4) 


where N(H2) = N(0)+N(1). Notice that the horizontal axis is linear, while 
the vertical axis is logarithmic. For N(Htot) < 2 x 104m? the fraction of 
molecular hydrogen is very small and extremely sensitive to N(Htot). For 
N(H¢ot) larger than ~ 107° m~*, a large fraction of the matter detected is in 
molecular form. 

A relationship of the type shown in Figure 8.5 arises naturally if the mat- 
ter observed on each line of sight is concentrated into clouds. Hz molecules 
form most rapidly in dense gas, where hydrogen atoms frequently collide 
with dust grains, which catalyze the conversion of hydrogen from atomic into 
molecular form. This fact alone leads one to expect that Hz will be most 
abundant in the dense cores of clouds. Moreover, gas near a cloud’s surface 
is exposed to ultraviolet radiation from early-type stars that lie outside the 
cloud. This radiation increases the fraction of hydrogen in atomic form by 
dissociating Hz molecules. The deeper one goes into the cloud, the smaller 
the ultraviolet flux becomes, and in a large cloud the flux may eventually dry 
up because all the ultraviolet photons incident on the cloud’s surface have 
been absorbed. In such shaded regions the abundance of Hg shoots up, since 
Hz molecules form rapidly and have long lifetimes. 
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8.1.2 Extreme UV and X-ray observations 


In all galaxies a significant fraction of interstellar space is occupied by gas 
that is so hot ([ > 10°K) that hydrogen and helium are fully ionized. 
Such hot gas is commonly called a plasma and emits predominantly in the 
extreme ultraviolet (EUV) and X-ray bands. The energies of EUV photons 
lie in the range 13.6 — 100 eV, while X-ray photons are generally classified as 
soft, medium or hard depending on whether their energies lie in the ranges 
0.1 —1keV, 1 — 10 keV or > 10keV. 

13.6 eV is the binding energy of a hydrogen atom. Interstellar hydrogen 
atoms offer a large cross-section for the absorption of photons with energies 
somewhat in excess of this value, an absorbed photon’s energy being used 
to eject the atom’s electron. The energy required to eject the first and 
second electrons from a helium atom are 24.6eV and 54.4eV, respectively. 
Consequently, He atoms and Het ions also offer large cross-sections to the 
absorption of EUV photons. Since significant column densities of HI, He and 
Het accumulate over short distances along any line of sight from the Sun, 
the local ISM is highly opaque to EUV photons and it is extremely difficult 
to probe the larger Milky Way or external galaxies with observations in the 
EUV band. 

The cross section for the absorption of a photon of energy F by an ion 
with ionization energy E; falls off as (FE — E;)~?. Hence the opacity of the 
local ISM due to hydrogen and helium is falling steeply as one enters the soft 
X-ray band. In fact, the opacity of the local ISM soon becomes dominated 
by ions such as O°+, which has ionization energy 138eV and is relatively 
abundant. Although the local ISM is by no means transparent at ~ 0.6 keV, 
we do receive significant numbers of soft X-rays from extragalactic objects 
that lie at high galactic latitudes b, since the line of sight to such objects 
does not run far through the disk of the Milky Way. Figure 8.6 indicates 
the distances that photons of various energies can travel through an ISM of 
particle density 10°m7~?. 
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No reasonably abundant species in the local ISM offers a large cross- 
section for absorption of photons with energy EH 2 2keV, with the con- 
sequence that at such high energies we can study sources throughout the 
Milky Way and in external galaxies almost irrespective of their latitudes b. 
Moreover, at these energies we may usually treat the interstellar media of 
external galaxies as optically thin,® which greatly simplifies the extraction 
of astrophysically interesting parameters from observed X-ray spectra. 

The X-rays emitted by a plasma are generated by two predominant 
processes: (i) the bremsstrahlung’ process in which a free electron scatters 
off an ion, and (ii) the bound-bound transitions of ions, in which an excited 
state of an ion decays to a less energetic state. 

The rate per unit volume of collisions between the free electrons and the 
ions of a fully ionized plasma is proportional to the square of the plasma’s 
density p because the electron density is proportional to p and the rate at 
which each electron suffers collisions is proportional to the ion density, which 
is itself proportional to p. Consequently, the bremsstrahlung emissivity of a 
hot plasma is proportional to p?. Quantitatively, for a plasma at temperature 
T the emissivity is 


jy = 5.44 x 1078? Z2nnyT V2 ge—hY/*F W m3 Hz}, (8.5) 


where Z? is the mean-square atomic charge on the ions. The Gaunt factor, 
g, is a dimensionless quantity of order unity, which depends on the range 
of distances between electrons and ions at which the scattering process oc- 
curs. Since the energy of radiation produced depends on the closeness of 
the encounter between electron and ion, g depends on the photon energy 
E = hv: Figure 8.7 shows g(£) for bremsstrahlung produced by plasmas at 
various temperatures. It is apparent that g depends fairly weakly on v, and 
so equation (8.5) implies that j, is almost constant at low frequencies and 
exponentially cut-off above v ~ kT /h. 

In an optically-thin medium, ions are excited by collisions with ener- 
getic electrons and de-excited either by collision with a second electron, or 
radiatively through the emission of a photon. At the densities characteristic 
of the hot component of the ISM, collisional de-excitation is much less im- 
portant than radiative decay, with the result that essentially every collisional 
excitation gives rise to the emission of a photon. From this it follows that 
the luminosity per unit volume in line radiation is also proportional to p?. 

In a hot astrophysical plasma, bremsstrahlung is not the only X-ray 
emission process. At high temperatures low-mass atoms are fully ionized, 
but heavier, highly-charged nuclei are able to hold on to some of their most 
tightly bound electrons. These remaining bound electrons are excited to 
higher energy levels by collisions with energetic free electrons in the hot 


6 See §8.1.4 for a discussion of optical depth in emission lines. 
” Bremsstrahlung is German for ‘braking’ (bremsen) ‘radiation’ (Strahlung). 
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Figure 8.8 X-ray spectrum of the elliptical galaxy NGC 4472. The horizontal extent of 
each cross shows the energy binning of the data, while the vertical extent indicates the 
RMS error in the signal. The histogram shows a model spectrum for a two-temperature 
plasma. [From data taken by the ASCA satellite and kindly provided by D. Buote] 


gas. The subsequent radiative decays of such excited atoms produce X- 
ray emission at specific energies. Thus, the X-ray spectra generally contain 
discrete emission lines from various highly-ionized heavy elements on top of 
the bremsstrahlung continuum. 

In the past, X-ray detectors have generally had very limited energy 
resolution. Emission lines in spectra were therefore smeared out to a point 
where they were difficult to detect for any but the strongest features. In 
recent years, this situation has improved dramatically. Figure 8.8 shows 
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an X-ray spectrum of the elliptical galaxy NGC 4472 obtained using the 
Japanese ASCA satellite. The CCDs that form the detectors in this satellite 
have better energy resolution than previous instruments, and it is apparent 
that this improved resolution makes it possible to detect a number of discrete 
emission lines. For example, the strong line at ~ 2.5keV arises from K- 
series® line emission from highly-ionized sulfur. Even with these improved 
data, many lines are still blended: the broad peak at ~ 1 keV in Figure 8.8 is 
attributed to a mixture of L-series emission from Fe in a variety of ionization 
states and K-series emission from Ne. Nevertheless, there is clearly a great 
deal of information to be gleaned from such high quality spectra. 

In principle, it is possible to determine the temperature, density and 
metallicity of a plasma from its X-ray spectrum. A further parameter that 
alters the observed spectrum at low energies is the amount of absorption by 
cold gas along the line of sight, either in the local ISM or in any material 
surrounding the emitting object. All these quantities can be estimated by 
comparing the observed spectrum to ones simulated using the full range of 
plausible parameter values. Simulating the X-ray spectrum of a plasma re- 
quires a large and complex computer program, which draws on a mass of 
atomic data. Unfortunately, many of the required data are not accurately 
known, and it is not practicable to include all possible atomic processes in 
a single computer program; one merely tries to include the most important 
processes. Uncertainties in the available atomic data and incompleteness of 
the codes in which they are used combine to produce significant uncertainties 
in the values of physical variables inferred from X-ray spectra, As the quality 
of X-ray data has improved, the shortcomings of widely-used spectral inter- 
pretation codes have become apparent, so parallel efforts have been made 
to improve the models [see, for example, Liedahl, Osterheld & Goldstein 
(1995)]. 

As well as forcing us to improve the spectral fitting codes, higher quality 
X-ray data have also made it necessary to consider more sophisticated as- 
trophysical models. While the data were poor, it was possible to explain the 
observations using simple-minded physical interpretations, but such models 
are frequently ruled out when the data quality improves. In the case of 
NGC 4472, for example, observed spectra cannot be reproduced by mod- 
eling the emission as arising from a single temperature plasma. As shown 
in Figure 8.8, a plasma with two distinct temperature phases could explain 
the observations of this galaxy, but even this model is probably an oversim- 
plification; the plasma might contain gas at a whole range of temperatures, 
or there might be non-negligible contributions to the spectrum from other 
sources, such as X-ray binaries in the galaxy. Testing these yet more complex 


8 X-ray spectroscopists have their own notation for defining spectral lines: transitions 
in which an electron moves down into an n = 1 state (see Box 8.2) produce “K-series 
lines,” those from electrons ending in an n = 2 state produce “L-series lines,” and so on. 
These series are analogous to the Lyman, Balmer, etc., series in hydrogen. 
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models must await still higher-quality X-ray data from the next generation 
of instruments. 


8.1.3 Optical emission lines 


The optical spectra of many galaxies contain conspicuous emission lines. 
These lines are generally formed in excited interstellar gas. The gas has 
usually been excited by ionizing radiation from either hot stars or an active 
galactic nucleus, but it has sometimes been excited by the passage of a 
powerful shock wave. We shall concentrate on the more important case of 
photo-excited regions. These regions are characterized by the presence of 
atoms in higher states of ionization than would occur in thermodynamic 
equilibrium at the kinetic temperature of the gas (T ~ 8000 K). 

The spectra of many hot stars are such that radiation from these stars 
ionizes essentially all hydrogen atoms within some limiting distance from 
the stars, while material beyond this distance is largely neutral. The region 
of ionized hydrogen is called an HII region and the volume within which 
it is contained is called a Str6mgren sphere. A great deal of valuable 
information about both the ISM and hot stars has been gleaned from the 
spectra of such photo-excited HII regions. 


Hydrogen lines Lines of the Balmer series such as Ha (656nm), HG 
(486 nm) and Hy (434nm) (see Figure 8.9) are prominent in the optical spec- 
tra of HII regions. In thermodynamic equilibrium at the kinetic temperature 
of a typical HII region (~ 8000K), there would be negligible flux in these 
lines since they arise from states that lie more than 12eV above hydrogen’s 
ground state. It follows that the observed lines are produced by H atoms 
cascading to their ground state following capture of an electron by a proton. 
The relative intensities of the different lines formed by this recombination 
process can be accurately calculated, and depend only weakly on T and (for 
interstellar densities) negligibly on the density. The relevant theory is simple 
because observed nebulae are invariably optically thin in these lines. 

The observed relative intensities of the Balmer lines do not agree with 
the predictions of theory. In particular I(Ha)/I(HG@), which is called the 
Balmer decrement, is larger than the predicted value I(Ha)/I(HG) ~ 3. 
This discrepancy between simple theory and observation is attributed to ab- 
sorption of line radiation by dust: since dust absorbs blue light more strongly 
than red (see §3.7.1), the fraction of HG photons that are absorbed by dust 
is larger than the fraction of the redder Ha photons that are absorbed. This 
conjecture can be tested by comparing the observed value of I(H8)/I(Hy) 
to that which it predicts; the agreement is generally found to be good. 

The number of ionizing photons absorbed by an HII region per unit time 
can be deduced from the observed intensity of any of the Balmer lines. The 
argument, which was first given by Zanstra (1927), is as follows. In a steady 
state, each photo-ionization of a hydrogen atom may be associated with 
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a hydrogen atom reaching its ground state by emission of a Lyman-series 
photon. Now a Lyman-series photon emitted within a HII region is very 
unlikely to escape from the HII region because the ground-state hydrogen 
atoms, which are so abundant in and around the HII region, offer a large 
cross-section for its absorption. Suppose the photon is not a Lya photon 
but one of the higher members of the series. Then the excited hydrogen 
atom produced by its absorption has a finite chance of emitting two or more 
photons before returning to its ground state. For example, when a Lyy 
photon is absorbed, there is a non-trivial probability for the emission of 
(a) an H@ photon followed by a Lya photon, or (b) a Paschen a photon 
(n = 4 > n = 3) followed by either (i) a Lyf photon or (ii) an Ha photon 
plus a La photon. Thus each time a Lyman-series photon is absorbed, there 
is non-negligible probability for the emission of a less energetic Lyman-series 
photon plus one or more photons of the Balmer, Paschen ..., series. It follows 
that after a sufficient number of absorptions, the energy of any Lyman-series 
photon will have been divided between a Lya photon and some less energetic 
photons. Moreover, the creation of a Lya photon by any process other than 
the destruction of a Lya photon is associated with the creation of a Balmer- 
series photon. Thus the rate of creation of all Balmer-series photons should 
equal the rate of ionization of hydrogen atoms. Finally, we recall that the 
relative strengths of the Balmer lines depend only weakly on temperature 
and density. Hence the rate of creation of all Balmer-series photons, and 
thus the ionization rate, can be estimated from the strength of any single 
Balmer line. 

The ionization rate within an HII region depends upon the luminosity 
of the exciting source in the EUV band. When knowledge of this rate is 
combined with the assumption that the source radiates as a black body and 
with an estimate of the flux at much longer wavelengths, for example in 
the optical or infrared bands, the temperature of the ionizing source can be 
determined. 


Metal lines When HII regions were first studied spectroscopically, as- 
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tronomers were puzzled by the two most prominent lines in the spectra. 
These green ‘nebular’ lines, Ny at 500.7nm and Ng at 495.9nm, defied in- 
terpretation until it was demonstrated by Bowen (1927) that they are ‘for- 
bidden’ lines of O?+. (See Box 8.1 for the meaning of ‘forbidden’.) 

In practice the spectra of interstellar nebulae are dominated by forbidden 
and semi-forbidden lines of the ions of heavy elements such as N, O and 
S. At first sight this may seem a very strange phenomenon: not only are 
these ions over 1000 times rarer than those of H and He, but the transitions 
that are dominating the observed radiation are precisely the least probable 
transitions of these ions! Actually it is not hard to understand the dominance 
of forbidden transitions. 

The only excited states of ions that will be significantly populated in an 
HII region are those whose energies lie within a few times kT’ ~ leV of the 
relevant ground-state energy. All such low-lying states have the same elec- 
tronic ‘configuration’ as the ground state,® with the result that all transitions 
between them and the ground state are forbidden. 

The forbidden nature of these transitions effectively prevents their giv- 
ing rise to interstellar absorption — as we saw in 88.1.1, significant interstellar 
absorption from ground states only occurs in the UV. But in a low-density 
medium a forbidden transition can cause significant emission simply because 
once an ion is knocked into an excited state, it will hang there until it de- 
cays radiatively. Thus the forbiddenness of the transition delays rather than 
prevents the emission of a photon. Notice, however, that this is true only if 
the half-life of the excited state is short compared to the collision time in the 
plasma; at high enough density the excited state is liable to be de-excited by 
a second collision before it has had time to decay radiatively. Consequently, 
if the emissivity per atom j of a plasma is plotted against density p, 7 will 
be proportional to p for p less than some critical density peri, at which the 
collision time becomes comparable to the radiative decay time.!° At higher 
densities 7 will be constant because the number of ions in the excited state 
will have leveled off at the value appropriate to thermodynamic equilibrium, 
and each excited ion has some probability per unit time of emitting a photon. 
Thus the emissivity of a dense plasma, such as that in a fluorescent light, 
is liable to be dominated by allowed transitions, which have high critical 
densities and peak emissivities. 

The temperature and density of an HIT region can be determined from 
optical lines of O, N and S. The relevant ions have energy-level diagrams of 
two types. Figure 8.10 is a schematic energy-level diagram for N+, O?+, and 
S?+, which have equivalent electronic configurations. For our purposes it 
suffices to distinguish three levels: that associated with the ?P ground term 
and those associated with the 1D and 1S terms, which lie 2.47 eV and 5.32eV 


9 See Box 8.2 for the meaning of ‘configuration’. 
10 Astronomers sometimes refer to peri, as the “density required for excitation.” It is 
more accurately called the “density required for full excitation.” 
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Box 8.1: Allowed and Forbidden Spectral Lines 


Standard time-dependent perturbation theory yields an expression for 
the Einstein B coefficients [see equation (8.6)] that is proportional to the 
squared absolute value of an infinite sum. The first term in this sum is 
proportional to the quantum-mechanical matrix element (1|d|2), where 
|1) and |2) are the quantum states of the ion before and after emission, 
and d is the operator for the ion’s electric dipole moment. The two 
next terms in the series are proportional to (1|q|2) and (1|m|2), where q 
and m are the ion’s electric quadrupole and magnetic dipole moments. 
On dimensional grounds the relative magnitudes of these quantities are of 
order |({1/d|2)| : |(1]q|2)| : {(41|ma]2)|  A/ao:1:1, where A is the wavelength 
of the photon concerned and apo is the Bohr radius. Since \/ao ~ 5000 in 
the optical band, the first (dipole) contribution to B dominates unless it 
vanishes by virtue of some exact or near symmetry of the ion. Forbidden 
transitions are those for which (1|d|2) vanishes. Their forbidden nature 
is indicated by surrounding the symbols of the ions in which they occur 
by square brackets thus: (O?*]. If (1|d|2) is much smaller than expected 
on dimensional grounds, the transition is called semi-forbidden and a 
square bracket is put to the right of the ion involved: N*]. 

A transition is allowed (i.e., not forbidden) if the differences in the 
quantum numbers of the states |1) and |2) obey the electric-dipole selec- 
tion rules. The most important quantum numbers characterizing these 
states are the atom’s total angular momentum F’,, its total electronic 
angular momentum J, its total electronic spin S and its total orbital 
angular momentum L. The selection rules for these quantities are 


6F =0,1 but F=0AF=0, 

6J =0,1 but J=0AT=0, 

6L =0,1 but L=0#L=0, (1) 
6S =0. 


F and J are always good quantum numbers but L and S are good quan- 
tum numbers only to the extent that the Russell-Saunders or L — S$ 
coupling scheme is applicable. The quantum states of atoms are gener- 
ally classified under the assumption that D—S coupling applies, although 
this coupling scheme is never more than an approximation, and one that 
deteriorates as one proceeds down the periodic table. In reality a state 
|1) that is classified as having, say, S = 1, may be a linear combination 
of a predominant state that has S = 1 and a state that has S = 0. 
This mixed state can decay by emission of electric dipole radiation to a 
state classified as having S = 0 without violation of the electric-dipole 
selection rule. Such intercombination transitions are semi-forbidden. 
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Box 8.2: Spectroscopic Notation 


Standard spectroscopic notation presumes that the [—S coupling scheme 
is valid. The electronic configuration is a specification of the principal 
nm and orbital angular momentum / quantum numbers of the individual 
electrons of the outermost shell. Within a configuration a spectroscopic 
term specifies definite values for the total orbital Z and spin S angular 
momenta of the outer electrons. Within each term a fine-structure 
level specifies a definite value for the total electronic angular momentum 
J, Within a fine-structure level may be distinguished different hyperfine 
levels that differ in total angular momentum F’. 


A typical configuration is denoted 2s2p* meaning one electron has n = 2, 
1 = 0, and three electrons have n = 2,1 = 1. 


Terms are denoted by (5+!) 1,7; for example 4P,/2 means S = 3, L=1, 
fat, 
2 


Figure 8.10 Schematic energy- 
level diagram of O?+ and Nt. In 
the case of O?+ transition A has 
wavelength 436.3nm while the wave- 
lengths of transitions B; and Bg are 
500.7nm and 495.9nm, respectively. 
In the case of Nt the wavelengths 
are A: 575.5nm, By: 658.3nm, Bo: 
654.8 nm. 


Figure 8.11 Schematic energy-level 
diagram of O+ and S+. In the case 
of O+ transition A has wavelength 
372.9nm while transition B has 

4 wavelength 372.6nm. In the case of 
372 SS S+ the wavelengths are A: 673.1nm, 


B: 671.6 nm. 
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above the ground term. The nebular lines that puzzled astronomers before 
1927 are produced by transitions in O?+ between the top and middle lev- 
els (at 436.3nm) and between the middle and bottom levels (at ~ 500 nm). 
Clearly the intensities of these lines will be proportional to the occupation 
probabilities of their respective upper levels. In thermodynamic equilibrium 
these would be in the ratio top:middle = 1:e?84°V/*T , and thus be sensitive 
to the plasma’s temperature JT. As we have seen, the densities of HII re- 
gions are so low that even states with long radiative decay times are out of 
thermodynamic equilibrium, and we must calculate the required occupation 
probabilities by considering the rates at which collisions and radiative de- 
cays populate each state rather than simply invoking the Boltzmann factor. 
Detail theory shows that the relative intensities of the lines is nevertheless 
approximately proportional to e?84eV/*T, 


Figure 8.11 shows the second type of energy-level diagram that plays 
an important réle in studies of HII regions. The energy-level diagrams of N, 
O* and St are of this type. The density of an HII region can be estimated 
from the relative intensities of the lines that connect the two middle levels, 
2Dsz /2 and 2Ds /2 to the ground level. Again we argue that the intensities 
of these lines is proportional to the occupation probabilities of their upper 
levels. Since the energies of these differ by much less than kT, in thermo- 
dynamic equilibrium the occupation probabilities will be in the ratio 4:6 of 
the degeneracies of the J = 3 and J = 3 levels. At lower densities the 
occupation probabilities will be determined by the relevant rate equations. 
One finds that in the limit of low plasma density it tends to 1:0.32. 


8.1.4 Radio observations 


Radio telescopes provide some of the most powerful diagnostics of interstel- 
lar gas. To interpret radio-frequency data correctly, it is necessary to delve 
rather more deeply than we have so far into the way in which a beam of 
radiation interacts with material along its path. Our analysis of this prob- 
lem is entirely general and not restricted in its validity to radio frequencies; 
we present it here merely because we wish to apply it to radio-frequency 
observations. 


The specific intensity at frequency v, I, of a beam of radiation is the 
energy flux in the beam per unit solid angle and per unit frequency.1! In 
distance ds along the beam, decays of atoms from excited states cause I, to 
increase, while excitations of atoms to excited states cause I, to decrease. 


11 The units of J, are usually WHz—! m~2sterad7?. 
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Mathematically, we write!? 


dl, hv 2 v v 4nl, 
ds = (=) [n§ ) hes = (ni Bro = ns ) Bor) - 1: (8.6) 
Here ni) is the number density of atoms capable of emitting (¢ = 2) or 


absorbing (¢ = 1) a photon of frequency v, Ag: is the Einstein coefficient 
that gives the probability for spontaneous emission of a photon, and Bo, is 
the corresponding coefficient for stimulated emission. Bj, is the Einstein 
coefficient for absorption of a photon of frequency vy. Defining 


hv, wy 
y= —(nf Big — nh Bas), 
4 

Aan eS (8.7) 
7 ny Byo — Ns Bo 

w= [x ds, 

equation (8.6) becomes!? 

dl, 
—4+I,=S,. 8.8 
dt, sae - 2) 


Ty, is called the optical depth and S,, the source function at frequency 
v. Equation (8.8) is a first-order linear equation such as may be integrated 
with an integrating factor e’”. In the especially simple case in which S, is 
constant along the line of sight, the solution may be written 


I,(ty) = 1, (0)e"™ + (1 —e7”). (8.9) 


Equation (8.9) states that the intensity at the observer is equal to the inten- 
sity at some distance decreased by an exponential factor plus a fraction of 
the source function. 

We now assume that the material through which the beam passes is 
in thermodynamic equilibrium at temperature T. Then the value of I,(7,) 


12 Binstein’s equation was No = —A2giNe2 + (Ni Bi2 ~— NeBo1)p, where N; is the total 
number of atoms in the it energy level and p is the radiation’s energy density per unit 
frequency at the frequency of the transition. We obtain equation (8.6) by (a) focusing on 


the atoms in unit volume; (b) recognizing that only ni dv atoms per unit volume could 
emit /absorb a photon with frequency in (v + dv,v); (c) figuring that each such transition 
will contribute thy to the energy density pdv; (d) recalling that for the isotropic radiation 
field considered by Einstein p = 4rI,/c; (e) setting ds = cdt. 

13 Tn the theory of stellar atmospheres, 7 is generally defined such that it increases away 
from rather than towards the observer, as here. Defining 7 in that way changes signs in 
equation (8.9). 
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given by (8.9) must tend to the Planck intensity!4 B, in the limit of long 
path length s, that is the limit 7, — oo. Clearly, this implies that S, = B, 
in any material that is in thermodynamic equilibrium, and equation (8.7) 


becomes aL 
dn +I, = B,(T). (8.10) 


Finally, when we replace S, in equation (8.7) by B, and exploit the fact that 
in thermodynamic equilibrium 


(8.11) 


where g; is the number of quantum states associated with energy level i, we 
obtain Einstein’s relations 


Ba , 9 Biz = 92Ba1. (8.12) 


At radio frequencies, we are generally working in the Rayleigh—Jeans 
regime, hy < kT, in which we have the approximation 


cee 


B,{T) ~ (8.13) 


Equation (8.9) becomes 


L(t) = 1 (0)e7” + —— ebie (1-e77). (8.14) 


Equation (8.14) has led radio astronomers to report surface brightness mea- 
surements in terms of the brightness temperature, Tz, defined by 


Tp = oe (8.15) 


With this definition equation (8.14) becomes 
Ta(tv) = Tp(O)e"” + (L-—e7™)T. (8.16) 


In the optically-thin limit of small 7,, we have for the case of negligible 
Tz (0) 
Ta(t) ~ WT. (8.17) 


1443, = = 2h" (e hu {kT _ 1)—1 is the specific intensity radiated by a black body. When 
multiplied by cos@ and integrated over a hemisphere and all frequencies we obtain the 
Stefan—Boltzmann law for the energy radiated per unit area per unit time from a black 


body: f = fandy cos @B, = oT’. 
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Under our assumption of constant S,, T, is, by its definition (8.7), propor- 
tional to the column density of material along the line of sight. Thus when 
the medium is optically thin and Tg(0) is negligible, Ty is likewise propor- 
tional to the column density along the line of sight. When the medium is 
optically thick, Tg measures the medium’s temperature rather than its col- 
umn density. We shall see that the ISM is often optically thin in the 21-cm 
line of atomic hydrogen and optically thick in the widely-observed 2.6 mm 
line of carbon monoxide. 

Observers frequently report results in terms of the measured antenna 
temperature T',. This is the temperature required of a black-body that 
fills the telescope beam in order to generate the same signal as the source 
being measured. If the source fills the beam (including any side-lobes), then 
T, = Tg. But if the source fills only a fraction a of the beam, T'4 = aT 
and T, is said to be depressed below Tg by beam dilution. Observers 
tend to report T'4 because it is the directly measurable quantity and the 
degree of beam dilution is frequently difficult to determine. Since the angular 
resolution of many radio telescopes is comparatively poor, in many studies 
the object does not fill the beam, with the result that T, is significantly 
smaller than Tp. 


The 21-cm line of atomic hydrogen The ground state of the gross 
structure of atomic hydrogen is split into two hyperfine levels by the inter- 
action between the spins of the electron and proton; the atom’s energy is 
6 x 10~% eV lower when the two spins are antiparallel than when they are 
parallel. Thus the true ground state has total angular momentum F = 0, 
above which lie three degenerate states with angular momentum F = 1. 
Photons with frequency v = 1.4204GHz or wavelength 21.105cm mediate 
transitions between the two levels. Since the temperature of the ISM cannot 
fall below that of the cosmic background radiation, 2.7 K, which corresponds 
to kT = 2.3 x 107*eV, the approximation hy ~ 5.9 x 10-&eV « kT is 
invariably valid in connection with the 21-cm line. 

Since neither the F = 0 nor any of the F = 1 states of atomic hydrogen 
has an electric dipole moment, the absorption or emission of 21-cm photons 
is ‘forbidden’. Thus the lifetime of the excited level is long (1.1 x 10’ yr) and 
the values of the associated Einstein constants Az, and B;; are extremely 
small. At densities of interest (n ~ 10° m7), the rate of collisional excitation 
and de-excitation is very fast (~ 400yr~') compared to the spontaneous 
decay rate, so equation (8.11) holds accurately with T equal to the kinetic 
temperature of the gas. 

Let us now calculate the optical depth in the 21-cm line. From equations 
(8.7), (8.11) and (8.12) we obtain in the Rayleigh—Jeans regime 


(Fae) n™) 
Ty = (| ——— _ ] — 


8.18 
ck T? ( ) 


where n = fds ni) is the column density of hydrogen atoms capable of 
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Box 8.3: Spin Temperature 


The populations n(F = 1) and n(F = 0) of the F = 0,1 levels of 
hydrogen are characterized by the spin temperature, T3pin, which is 


defined by 
n{ F = 1) = Sexp (SM), (1) 
n(F = 0) kT spin 

where vp is the frequency of the 21cm transition. At the low den- 
sities that occur astrophysically Typin usually differs from the kinetic 
temperature. Naively, one would expect Tspin < Tiinetic at densities, 
n S$ 5x 10%m~°, so low that the collisional excitation time exceeds 
10’ yr. However, Watson & Deguchi (1984) show that at very low den- 
sities, Tspin 2 Tkinetic aS a result of the tendency of resonant scattering 
of Lya photons to equalize the occupation probabilities of the hyperfine 
states. In the presence of a powerful 21-cm continuum radiation field, 
such as that emitted by an AGN, one may even have Tspin > Thinetic: 


absorbing at frequency v. An individual hydrogen atom has negligible proba- 
bility of absorbing photons whose frequencies lie outside an extremely narrow 
frequency range (6v S 5 x 10715 Hz) in the atom’s rest frame. Consequently, 


Nn) is determined by the distribution of atoms over radial-velocity: 
NM dv = Ni f[(1 —v/vo)c]dv, (8.19) 


where Ny = f dv N™, dv = cdv/v and f(v) dv is the fraction of all atoms 
on the line of sight with radial velocity v in the range (v + dv,v). Since 
kT > hvo, we have from equation (8.11) that No/Ni ~ go/g1 = 3, and thus 
that N, ~ Ny. Inserting this approximation and the appropriate numerical 
values into equation (8.19), we have finally 


Bygh?v 
Ny = —_- 
ty = Cf (1 —v/v)c| T° where 4k (8.20) 
= 5.489 x 10°79 m’s? K. 


Three factors determine f(v): (i) the random thermal motions associ- 
ated with the gas temperature T; (ii) the random velocities that different 
macroscopic volumes of gas possess because the ISM is turbulent; (iii) the 
velocities that different macroscopic volumes of gas have because there are 
large-scale ordered velocity gradients in the ISM, for example on account of 
differential Galactic rotation. Thermal motions contribute a characteristic 
width ~ 1kms~! to f(v) (see Problem 8.1), while studies of 21-cm emission 
from face-on galaxies suggest that turbulence contributes a width of order 


8.1 Detection of interstellar matter 473 


4 et 
3 tr 
‘4 Figure 8.12 Peak optical depth 
f 2 in the 21-cm line as a function of 
t r longitude for a naive model of the 
[ galactic disk. The number density of 
fe hydrogen atoms has been assumed 
1 ies to equal 2 x 10° m7? at Ro and to 
b vary with R as exp(—R/2Rpo). The 
ia circular speed is given by uc(R) = 
go isiiriiitiiiitir (R/Ro)°! x 210kms~! and the emit- 
0 50 100 150 ting atoms have a velocity dispersion 
| o = 0.04u-(Ro) © 8.4kms—}. 


7kms~—‘ (van der Kruit & Shostak 1984, Hanson, Dickey & Helou 1989). 
The width contributed to f(v) by Galactic rotation varies enormously with 
the longitude J of the line of sight. In the center and anticenter directions 
it is negligible, and it can exceed 100kms~! at intermediate directions (see 
Problem 9.2). Figure 8.12 shows 7,, (1) for a simple model of the galactic disk. 
In the center and anticenter directions the optical depth exceeds unity with 
the result that the brightness temperature lies close to the gas temperature 
T, while at | ~ 120° the optical depth falls to 7,, ~ 0.5. 

Since the radial-velocity distribution f(v) is normalized to ensure that 
f dv f =1, the value of the total neutral hydrogen column density Ny can 
be obtained by integrating equation (8.20) over velocity. With velocities 
measured in kms~, this yields 


co 
Ny = 1.82 x 10% [ dv Tr(v) atoms m~?, (8.21) 
If the material is optically thin, we may replace T7(v) by the brightness 
temperature Tg [see equation (8.17)|: 

Nu(l,b) = 1.82 x 10% f dv Tp(I, b, v) atoms m~”. (8.22) 
When applicable, this equation is to be preferred to equation (8.21) because 
Tg is directly measured while T and 7(v) are not easily determined. 

When we observe an external galaxy, we can determine its total atomic 
hydrogen content Ny by integrating the HI column density over the surface 
area of the system. If we write an element of area as dS = D?dQ, where D 
is the distance to the galaxy and dQ is an element of solid angle on the sky, 


and if we write the angle- and velocity-dependent brightness temperature as 
Ta(l,b,v), then 


Nu = 1.82 x 10%? f dv if dQ Ta (i, b, v) atoms, (8.23) 
—oo 2 
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where 22 is the solid angle subtended by the whole galaxy. The corresponding 
mass of hydrogen is 


Ma 
ro} 


2f~ dvs 
= ) Fico dv 5(v)_ (8.24) 
Mpc Jykms—! 


= 2.36 x 108( 
Notice that both M y and the total luminosity L of a galaxy are proportional 
to D?, so that the ratio (My/L) is independent of the often uncertain dis- 
tance to an external galaxy. 


Rotation transitions of heteronuclear molecules The spectra of sev- 
eral fairly abundant molecules such as CO, CS and HCN have readily ob- 
served mm-band lines. These lines are formed by molecules changing their 
angular momentum by fi while retaining the same electronic and vibrational 
quantum numbers. On account of the constancy of these quantum num- 
bers, the energies of these transitions are much smaller than those of the 
compound transitions of Hz studied in §3.7, and the lines fall in the mm 
wavebands rather than in the ultraviolet. These mm-band lines constitute 
powerful probes of the denser and colder components of the ISM. Much the 
most important lines are those of CO at wavelengths 2.6mm and 1.3mm. 
We shall concentrate on these lines, but most of our analysis applies with 
minor modification to lines of other molecules. 

Unlike homonuclear molecules such as Hz and Og, a heteronuclear 
molecule such as CO has a net dipole moment, and consequently should 
radiate when it spins. Quantum-mechanically, this fact manifests itself in 
the Einstein constants, Aj, etc., for the mm-lines of CO being much larger 
than the Einstein constants associated with the 21-cm line. 

The differences between the relevant Einstein constants for CO and for 
HI have two important consequences. First the lifetimes of rotationally ex- 
cited levels of CO are relatively short, and unless the gas density and collision 
frequency are high, equation (8.11) may be violated in a way that it is not in 
the case of HI. Second, a much smaller column density of CO molecules than 
of HI atoms is required to establish a given optical depth in the relevant lines 
[see equation (8.7)]. In fact, the intrinsic strength of the rotation transitions 
of CO effectively compensates for the low abundance of CO molecules rela- 
tive to hydrogen, so that the strongest CO lines saturate at similar column 
densities of all atoms as does the 21-cm line (~ 2 x 10?8atoms m~?/kms~*). 
Table 8.1 gives the column densities of CO and Hz molecules, distributed 
over a velocity range of [kms~! that will yield unit optical depth at 20K, 
a typical temperature for molecular material.!> These values apply only if 
the gas is at a sufficiently high spatial density for equation (8.11) to hold; at 
lower spatial densities, unit optical depth will be achieved at higher column 
densities than those given the table. Notice that the 1.3-mm line arising from 


15 The column density required for a given line to saturate scales as T? (Goldreich & 
Kwan 1973). 
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Table 8.1 CO rotation lines 
Species Relative abundance ‘Transition N;=:(12CO) N,=i(He) 


1agl8o 1 J=2—>J=1 55x10 3.9 x 10 
1-0 2.2 x 107° 1.7 x 1074 
GO 1/65 251 3.6 x 1074 2.8 x 1075 
130 1.5 x 107? 1.1 x 1076 
Nolo 1/500 2-1 2.8 x 1072 2.2 x 1076 
10 1.1 x 1073 8.5 x 1076 


NOTEs: For each species, N,=1(!2CO) is the density in atoms m~?/kms~! 
of 12C!®Q molecules at T’ = 20K that yields optical depth unity in the 

given line; N,—,(Hz2) is the corresponding column density of molecular hy- 
drogen for an assumed ratio N(13CO)/N(H2) = 2 x 10~® (Dickman 1978). 


the J = 2 — J = 1 transition saturates at a column density that is a factor 
4 smaller than the column density at which the 2.6-mm J = 1-—> J =0 line 
saturates. 

Besides the common isotope !#C'®0, (or 12CO in astronomical short- 
hand) there are the rarer forms 4°C!®O (!8CO) and 12080 (C180), whose 
abundances are locally approximately 65 and 500 times smaller than that 
of the common isotope (Langer 1997). Each of these species forms lines of 
slightly different frequency, and since each line saturates at a different total 
column density of all material (Table 8.1), valuable information about the 
structure of clouds can be deduced by comparing the lines associated with 
each species. Phillips et al. (1979) made such comparisons for lines of sight 
through several dark clouds in the Milky Way. They find that the antenna 
temperature in the 1.3-mm !*CO line is invariably very similar to that in the 
2.6-mm 12CO line. From this observation, it follows that the cloud must be 
optically thick in both these lines because, if it were thin in the less opaque 
line (the 2.6-mm line), the latter would yield a lower antenna temperature. 
Figure 8.13 shows the spatial profiles of the dark cloud in NGC 2024 when 
observed in the 1.3-mm lines of !2CO, 1°CO, and C!*O. The peak intensities 
of these radiations are in the ratio 1:3:2, from which it follows that the cloud 
must be optically thick in the 1.3-mm line of !°CO because, if it were thin 
in this line, it would be thin also in the corresponding line of C!%O, which 
would then be fainter than the 1°CO line by a factor of 500/65 ~ 8, com- 
pared to the observed value of 2.5. Comparison of the 1.3-mm and 2.6-mm 
13CO antenna temperatures of several clouds confirms that }°CO is usually 
optically thick at line center. If either of these lines were optically thin, the 
1.3-mm temperature would be nearly a factor of four higher than the 2.6- 
mm temperature, whereas they are, in fact, comparable. Nevertheless, the 
13CO temperatures are invariably at least a factor two lower than the “CO 
temperatures. This behavior suggests that the unit-optical-depth surface of 
13CO has either a smaller radius or a lower brightness temperature than that 
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dx Figure 8.13 The dark cloud NGC 

4 10 13CQ, _ (x2) 2024 traced in the 1.3-mm line of 

ts three species of CO. The top profile 
suggests that the region that is opti- 

0 cally thick in the !2CO line is spatially 
resolved and the gas has kinetic tem- 
perature near 20K. Only a smaller 
portion of the cloud is optically thick 
in the 13CO line, and the C80 line 
may be optically thin everywhere. 
Notice that the peak antenna tem- 
peratures differ by much less than the 
abundances of the various species. [Af- 
~40  -20 0 20 40 ter Phillips et al. (1979)] 
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of !2CO. 

From these results it follows that no molecular line is ideally suited to 
a determination of N(H2): a line that is weak enough to be optically thin 
through a cloud’s core will be too weak to measure accurately towards the 
cloud’s periphery, where much of the mass resides. Moreover, the abundances 
of the different isotopes studied are uncertain. One cause of uncertainty is 
that the abundance ratios n(H2)/n(}*C) and n(?2C)/n(}8C) are affected by 
the passage of interstellar material through stars (astration) with the con- 
sequence that n(!2C)/n(13C) is believed to decrease from a value as low as 
20 at the Galactic center to perhaps 90 at large Galactic radii (Langer 1997). 
Another source of uncertainty is the possibility that one isotope may have a 
greater propensity than another to be locked up in CO molecules. Finally, 
variations in temperature and density within a cloud both affect the emis- 
sivity of CO molecules in a way that contrasts with the near temperature- 
and density-independence of the 21-cm emissivity of interstellar H atoms. To 
solve these problems properly, one would have to model the radiative trans- 
fer of each line studied through a detailed model of each observed cloud, 
adjusting the temperatures and densities within the clouds until the pre- 
dicted maps of Tg agreed with those observed. Not only would this be a 
formidable undertaking, but it presupposes that no brightness temperature 
varies on angular scales too small to be resolved. In practice even the near- 
est sources seem to contain unresolved structure — see §9.6.1. Consequently, 
detailed modeling of the type described is not feasible. Nonetheless, we shall 
see in §9.6.1 that when clouds in the Milky Way are observed at relatively 
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low angular resolution, the antenna temperatures of weak lines are roughly 
proportional to the antenna temperatures of strong lines. This suggests that 
the average column density CO within a large beam is roughly proportional 
to the integral over velocities of the antenna temperature in a strong line 
such as the 2.6-mm line of CO. 

Let us define 


Ico = i dv T4(2.6 mm line of ‘*CO). (8.25) 


The units of Ic9 are Kkms~!. The following naive theoretical argument 


suggests that the mass of a molecular cloud will be proportional to its value 
of I¢g. When large molecular clouds are observed at high resolution, they 
are found to be rich in internal structure; knots and filaments are apparent 
and there is every reason to believe that within these there is even smaller- 
scale, unresolved structure — see §9.6.1. A crude model of this complexity is to 
imagine that the cloud is made up of a large number of cloudlets. Irrespective 
of whether it is optically thick in a given CO line, each cloudlet would in 
isolation contribute a definite amount to Igg. Now consider the value of 
Ico that we will measure for a big cloud. The contributions to Ico from 
individual cloudlets will add unless they frequently shadow each other in the 
sense that behind the optically thick portion of one cloudlet there is another 
with a velocity which differs from the velocity of the first cloudlet by less than 
twice the line-width o of a cloudlet. Clearly, if shadowing is unimportant, 
the value of Igo measured for the entire cloud will be proportional to the 
number of cloudlets and therefore to the mass M of the cloud. In fact, Ico 
will be proportional to M even if shadowing is a significant effect, providing 
that the fraction of all clouds that are shadowed at a given time does not 
vary with M. 

Suppose the cloud has radius R and each cloudlet has mass m. Then 
there are M/m cloudlets in the cloud, and along a line of sight through the 
center of the cloud there are ~ M/(mR?) cloudlets per unit area. Suppose 
each cloudlet is optically thick in the relevant line over area A, and let A 
be the velocity dispersion associated with the random motion of cloudlets 
through the cloud. Then shadowing will be important if and only if the 


‘shadowing factor’ Te 
©= (Te) (a) (8.26) 


is greater than unity: the first factor gives the mean number of cloudlets 
whose centers project within the opaque area of a randomly chosen cloudlet, 
and the second factor gives the probability that the velocity ranges of the 
two cloudlets significantly overlap. 

Now suppose that the overall dynamics of the cloud resemble those of 
a star cluster, with the mutual gravitational attraction of the cloudlets bal- 
anced by the random motions of the cloudlets. Then by the virial theorem 
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(e.g., BT §4.3) A will satisfy A ~ (GM/R)\/?. Using this to eliminate A 
from (8.26) and rearranging slightly, we obtain 


sm (5) 0M? (8.27) 


where p = M/R? is a characteristic mean density of the cloud. In equation 
(8.27) for the shadowing factor, the bracketed term depends only on the 
characteristics of individual cloudlets, which it is reasonable to assume do 
not vary from cloud to cloud. Hence we expect crowding to become more 
or less important with increasing M according to whether p increases or 
decreases with M. Observations show that p decreases with M - see 89.6.1. 
Moreover, crowding is clearly not significant if M is so small that the cloud 
is made up of only one, two or a handful of cloudlets. So we conclude that 
crowding is never important, and that Ico is always proportional to M. All 
that remains is to calibrate this proportionality. 

Studies of molecular clouds in the Milky Way that are discussed in §9.6.1 
yield the calibration (Strong et al. 1988) 


N(H2)/m~? 
Igo/Kkms7} 


x= ~ 2.3 x 10*4. (8.28) 
It must be emphasized that this is a rough empirical relation rather than 
one such as equation (8.22) that is based on good physics. Moreover, it is 
based on studies of the Milky Way (see §9.6.1) and it cannot be applied with 
confidence to galaxies of very different luminosities or morphological type. In 
particular, one would expect X to increase with metallicity, and observation 
confirms this expectation. Wilson (1995) finds that 


X ~3 x 10"*{(6 40.9) — (0.7 + 0.1)[12 + log,,(O/H)]}. (8.29) 


This fit to the data corresponds to a five-fold decrease in X for a tenfold 
decrease in the metallicity. 


Synchrotron radiation A charged particle that moves in a magnetic 
field spirals around field lines and radiates because the Lorentz force is ac- 
celerating it towards the field line along which it is spiralling. If the particle’s 
velocity is sub-relativistic, the radiation it emits is nearly monochromatic; 
it is called cyclotron radiation and its frequency is the gyro-frequency 
Vg = qB/2xm, where q and m are the particle’s charge and mass, respec- 
tively. If the particle is at relativistic velocity v, it radiates broad-band 
radiation which peaks at the critical frequency v, = S47 V, sin 9, where 
+ = (1—v?/c?)-1/? is the particle’s Lorentz factor and @ = cos~!(v-B/vB) 
is the pitch angle of the helix it describes while traveling through the mag- 
netic field. The power radiated is 
dy 


—moc* —=2 


q 2 2-2 
dt — (seen) 7287 (v/c)" sin” 8, (8.30) 
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where pg = B?/2jo is the energy density of the magnetic field. Since this 
power is proportional to (q/m)*, electrons are more than three million times 
more effective at. generating synchrotron radiation than protons, and nearly 
13 millions times as effective as most other nuclei. Consequently, observed 
synchrotron radiation is generally assumed to have been generated by elec- 
trons. In the case of electrons, the term in the large round bracket in equation 
(8.30) evaluates to the Thomson cross-section, a7 = 6.7 x 10779 m?. 

The solar neighborhood proves to contain relativistic particles (‘cosmic 
rays’) gyrating around a pervasive magnetic field of strength B ~ 0.6nT 
(Rand & Kulkarni 1989). Electrons make a comparatively small contribution 
to energy density of the cosmic rays that reach the Earth, but in interstellar 
space, far from the disturbing influences of the solar wind and the Earth’s 
magnetic field, electrons are probably a major constituent of cosmic rays. 
At the Earth the overwhelming majority of cosmic-ray particles are protons. 
Helium nuclei are 7 times less common than protons and most other nuclei 
more than a thousand times less common.!6 

The numbers of particles N()dy with Lorentz factors in the range (y+ 
dy, y) is approximately given by 


2.5 6 
Y for y S$ 10°, 
N(y) x {731 for 108 <s y Ss 1011, (8.31) 


The total energy density due to these particles is pg ~ 3.9 x 107% Jm~4 
(Holzer 1989), which is comparable to the energy density pg ~ 1.4 x 
10-13 Jm~% of the interstellar magnetic field. This energy is concentrated 
in the lowest-energy particles, which are at most mildly relativistic. Hence 
cosmic rays should be thought of as comprising suprathermal particles; 
only some cosmic-ray particles are truly relativistic, but all have kinetic en- 
ergies much greater than the kinetic energies of the thermal particles past 
which they stream. 

A population of gyrating particles with Lorentz factors that are dis- 
tributed as a power law N(y) ~ y~* generates synchrotron radiation with 
an emissivity j, ~ v~*~)/2 that is a power law in frequency. At high 
frequencies the spectra of many synchrotron sources, for example young su- 
pernova remnants and the lobes of radio galaxies, are observed to fall off as 
fy ~ v~°. If these sources are optically thin to synchrotron radiation, it 
follows that the underlying electron spectrum has x ~ 2.5, consistent with 
equation (8.31). At low frequencies the spectra of synchrotron sources tend 
to f, ~ v®/? as a result of self-absorption:!7 a synchrotron photon from one 
electron is liable to be absorbed by another. It turns out that the fraction 


16 The exceptions are C and O nuclei, which are less common by factors of ~ 250 and 
~ 300, respectively. 

17 Note that the self-absorbed synchrotron spectrum does not follow the Rayleigh—Jeans 
form ~ v? because the source is far from thermodynamic equilibrium. 
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of the photons that are emitted into the line of sight that do not reach us 
because they are absorbed en route increases with diminishing frequency v. 
Consequently, an optically thick source, in which the absorption of photons 
is important, becomes faint at low frequencies. 


Radio-frequency bremsstrahlung and recombination lines A valu- 
able diagnostic of comparatively dense ionized gas such as that of HII regions 
is provided by observations of radio-frequency bremsstrahlung. By equation 
(8.5) the emissivity of hot plasma is approximately independent of frequency 
for v < kT/h. Consequently, the observed radio-frequency spectrum will 
be flat if the plasma is optically thin. By contrast, we have seen that the 
black-body spectrum B,(T) rises as v* in the Rayleigh—Jeans regime — see 
equation (8.13). Since the specific intensity that emerges from any body 
that is in thermal equilibrium tends to the Planck function at the body’s 
temperature as the optical depth through the body becomes large, it is clear 
that at sufficiently low frequency, every thermal plasma must become opti- 
cally thick to bremsstrahlung. The value of the frequency v, at which the 
spectrum passes from the form f, ~ constant characteristic of bremsstrah- 
lung, to the Rayleigh-Jeans form f, ~ v? that holds below 11, allows one to 
determine the emission measure of the plasma f ds Z?n.n1. The plasma’s 
temperature is the observed brightness temperature at v < 14. 

In a plasma with temperature T ~ 10*K, significant numbers of free 
electrons will be captured by protons into states that have values of the 
principal quantum number n 2 50. The highly excited hydrogen atoms 
thus formed are liable subsequently to decay by cascading down through 
states of ever smaller n, each decay being accompanied by the emission of 
an appropriate photon. Quantitatively, on moving from the level n + 6n to 
the level n, an atom emits a photon of frequency 


1 1 100 3 
= 15 ~ 
vy = 3.29 x 10 ( haar 5 3) os 6.58( ) én GHz. (8.32) 


Analysis of the radio recombination lines formed in this way yields a 
number of astrophysically interesting quantities. 

An estimate of the plasma’s temperature T can be obtained from the 
ratio 


jo = (8.33) 


where J; is the peak intensity of the line, J, is the intensity of the brems- 
strahlung continuum at the line’s central frequency and b/c is the ratio of 
the line’s velocity-width to the speed of light. The precise temperature de- 
pendence of g is not easy to calculate because it is sensitive to departures 
from thermodynamic equilibrium in the plasma, but crudely q «x T7}. 

A quantity that is easier to estimate from radio recombination lines is 
the helium abundance of the plasma. When a He?* ion captures an electron 
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into a level n 2 50, the highly excited He atom that results is scarcely 
distinguishable from a similarly excited H atom; the cloud formed by the 
inner electron of the He atom is of negligible size compared to an orbital 
with n 2 50 and effectively screens one of the nucleus’s two units of charge. 
The frequencies at which it emits photons are different, however, by virtue 
of the difference in the reduced masses py = me/(1+ me/mp) and Une = 
me/(1+ iMe /mp) of the two atoms. Hence in the optically-thin limit, the 
ratio of the intensities of H and He recombination lines for a common value 
of n, is accurately proportional to ratio of the number of Ht and He* ions 
in the telescope beam. Moreover, it turns out that in HII regions excited 
by stars of spectral type earlier than O7, the volume within which nearly all 
hydrogen is in the form of H* coincides with the volume within which all 
helium is in the form Het. Consequently, for such nebulae the abundance 
by number of helium is directly given by the observed ratio of recombination 
line strengths. Independent estimates are provided by each measurable pair 
of corresponding H and He lines. 


Dispersion and Rotation Measures When plane polarized radiation 
of wavelength propagates through a plasma in which there is a component 
By of magnetic field parallel to the direction of propagation, the radiation’s 
plane of polarization slowly rotates. This phenomenon is called Faraday 
rotation. The angle of rotation 7 is wavelength-dependent, being given by 


= Ry)’, (8.34) 


where 
a dsn.B 
Ry = — 6 
o 2am2ct ih | 


(8.35) 
= 2.64 x 107 fas n-By m7. 
The coefficient Ry is called the rotation measure of the path. 

Radiation from pulsars and all sorts of synchrotron sources is signifi- 
cantly polarized. When measurements of the direction of the electric vector E 
are available at several wavelengths, it is found to be wavelength-dependent. 
When the position angle ¢ of E is plotted against \? an approximate straight 
line is obtained, and by equation (8.34) the slope of this line is Ry. 

If the distance to the source is known, Ry yields an estimate of the 
mean value of the product n.B) between the observer and the source. If 
the mean value of n, is known from some other measurement, for example 
from the timing of pulsars (see below), then estimates of ne and By can 
be obtained. For example, Rand & Kulkarni (1989) obtained the estimate 
B~0.6nT for the local magnetic field strength cited above by measuring 
the rotation measures of 200 pulsars. 
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The refractive index of plasma differs from unity; at frequency v it is 


n(v) = 4/1 —v2/v?, (8.36) 


where v, is the plasma frequency 


Nee? Ne \1/2 
Vp = Antena = 8.97(—=;) Hz. (8.37) 


On account of the frequency dependence of n, the group velocity cy = dw/dk 
of radiation is frequency-dependent. One finds 


Cc dlnny\-1 
tae = 2/2 
C= “(14 = = €4/1 — v2 /v?. (8.38) 


The time for a pulse of central frequency v to arrive from a source at distance 
D is 


D 
t(v) =| ds/cg(v) = oft HER + --] 


; (8.39) 


Oe Gee 
Cc 8r2e9MeD v2 

where Dy = f dsn, is the dispersion measure of the source and v > vp 
has been assumed. Dy can be determined for the line of sight to a pulsar 
by fitting to equation (8.39) the times of the pulses observed at different 
central frequencies. Numerically, if Dy is expressed in units of m7? pc, the 
difference At between the arrival times of a pulse at two frequencies is given 
in ms and the frequencies are given in MHz, we have 


Dy = 0.241At(vz? — vy). (8.40) 


8.1.5 y-ray emission 


As they speed through interstellar space, cosmic rays are liable to smash 
into other nuclei at relativistic speed. Such traffic accidents have two ob- 
servable consequences: (i) the emission of 7-rays from the colliding particles; 
(ii) the spallation, that is fragmentation, of one of the nuclei. The first 
effect causes regions in which the ISM is densest to glow when observed by a 
y-ray telescope. Since the intensity Q, with which y-rays are emitted must 
be proportional to the product q,n of the local cosmic-ray flux q, and the 
density n of the ISM, n can be estimated by measuring Q, if one knows 
qy. It is frequently assumed that q, varies only slowly from point to point 
within the ISM because fast-moving cosmic-ray particles diffuse fairly fast. 
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This idea is the basis for the calibration of equation (8.28) to be discussed in 
89.6.1. Cosmic-ray induced spallation of nuclei increases the abundance of 
light nuclei in cosmic rays relative to the abundance of brittle heavy nuclei. 
By estimating what fraction of heavy nuclei have been broken up by colli- 
sions, one can determine how long a typical cosmic ray has been streaming 
through space. 

Below 50 MeV 4-rays from the ISM are largely bremsstrahlung emission 
from encounters between nuclei and electrons with y-factors in the range 1 — 
200. Photons which have gained energy by scattering off even more energetic 
electrons (Inverse-Compton scattering) also make a contribution to the 
observed y-ray fluxes. The mutual annihilation of positrons and electrons 
gives rise to a prominent line at 0.511 MeV. Line-radiation from decaying 
nuclei contribute in the range (1MeV < E, < 50MeV). Above 50MeV 
decaying pions make an important contribution to the observed fluxes. 


8.1.6 Radiation by dust 


In §3.7 we studied the extinction and reddening of starlight by dust. We saw 
that absorption is characterized by the extinction curve, which is a plot of 
the ratio A,/A, of the extinction at arbitrary frequency v to the extinction 
in some fiducial band, for example the J band. Dust does not merely absorb 
light; it re-radiates much of the energy it absorbs. Here we discuss radiation 
by dust and what can be learned from it about the ISM. 

The key to understanding emission by dust is Kirchhoff’s law, which 
may be loosely phrased “good absorbers are good emitters.” Quantitatively, 
Kirchhoff’s law states that if a body of temperature T absorbs a fraction 
Q, of the radiation of frequency v that hits it, then the outgoing flux at 
its surface is Q, B,(T), where B,(T) is the Planck function. We apply this 
law to a box containing a sample of the ISM. For simplicity we assume that 
the grains in the box are all at the same temperature T, and that they have 
a frequency-independent albedo y — see §3.7.1. Then the fraction of the 
radiation of frequency v that enters the box on a given line of sight, which is 
absorbed in the box is [cf. equation (3.68)] (1 — y)(1 — 10-°-44»), where A, 
is the extinction along that line of sight. So by Kirchhoff’s law the flux out 
of the box along the given line of sight is 


(1—)(1 — 107° 44+) B,(T). (8.41) 


It is useful to reformulate this physical argument in the language of ra- 
diative transfer introduced in §8.1.4. Suppose the grains have a number den- 
sity n(x), where x measures position along a line of sight from the observer, 
and an average absorption cross-section per particle K,(v) at frequency v. 
Then, on traversing a slab of thickness dz a beam of light with wavevector 
k and intensity J, suffers a change in intensity 


dd = [Sk —n(z)Ka(v) Ix] dex, (8.42) 
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where v = k/2mc and S,dz is the intensity added to the beam by scattering 
and by emission. Now the contribution from scattering is 


i(k) = -n(e)K.(0) [ae — fae e ow) (8.43) 


where K, is a coefficient that characterizes the reflectivity of the grains. The 
first term in the square brackets of equation (8.43) describes scattering out of 
the direction of k while the integral describes scattering into the direction of 
k; the variable . = k’-k/k? is the cosine of the angle between the wavevectors 
k’ and k of light before and after scattering. The function p describes whether 
the scattering is isotropic (9 = constant), or predominantly forward, etc. Let 
S.-(k) be the contribution to S, in equation (8.42) from emission and define 
the optical depth 7 by 

dr, = n(x) K,(v) dz. (8.44) 


Then equation (8.42) may be written 


dx K, _ Sek) 
Ge thet 5 [he / du leot)] = oy (8.45) 


In the limit of large optical depth, we know that I, tends to the Planck 
function B,(T). Moreover, the contents of the square bracket in (8.45) then 
vanish since as much radiation is scattered into the beam as is scattered out 
of it. Clearly if I, = B,(T) is to satisfy (8.45), we must have 


S.(k) = n(x) K,(v)B,(T). (8.46) 


To determine the relation between 7, and A, we assume that J, is non- 
zero only for k parallel to the direction from some star and integrate equation 
(8.45) in the limit B, — 0; this is the appropriate limit if the dust is too 
cold to emit significantly at frequency v, as will be the case if v lies in the 
visual band and T ~ 10K. Then we find 


1,(r,) = (0)e~ + Ke/ Ke)re (8.47) 
Hence 


Ay = —2.5 logy [IL(7_)/IL(0)| = 2.5 logy, ett Ks/Ka)r 


(8.48) 
= 1.086(1 + K,/Ka)tv- 

This shows that A, is proportional to 7,, which is itself [see equation (8.44)] 
proportional to the column density of grains along the line of sight. More- 
over, the ratio A,/A,’ of the extinctions at two frequencies is just the ratio 
of the values taken by (1+ K,/K,) at those frequencies. Consequently, 
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Figure 8.14 The reddening 

E(B — V) down various lines of 
sight is approximately proportional 
to the column density of hydrogen, 
N(Htot), along that line of sight. 
The straight line is given by equa- 
tion (8.49). [From data published in 
Bohlin et al. (1978)] 
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if the distribution of grain properties were universal, the extinction curve 
(Figure 3.17) would be the same down all lines of sight. 

Figure 8.14 plots for various lines of sight the reddening E(B — V) (see 
§3.7.1) against N(Htot), the column density of hydrogen in all its forms. 
Although there is appreciable scatter of the points, the linear relationship 


N(Htot) = 5.8 x 107° E(B —V)m~?mag™?, (8.49) 
provides a fair summary of the data. With equation (3.63), this implies that 
N(Hyot) = 1.9 x 107° Ay m~?mag™?. (8.50) 


Thus, since we have shown that Ay is proportional to the column density of 
dust, to a first approximation, a given mass of gas contains a characteristic 
mass of dust, independent of whether the gas is in molecular or atomic form. 
Closer examination of data for individual interstellar clouds shows that fast- 
moving clouds tend to contain less dust than clouds that move more slowly 
relative to the surrounding material (Spitzer 1982). This difference is thought 
to arise because fast-moving clouds have been recently shocked, and dust 
grains tend to fall to pieces in the hot gas downstream from a shock wave. 

Equations (8.46) and (8.48) establish an intimate connection between 
emission by dust and extinction. In fact, measurements of A,, the albedo 
‘,, and the emissivity enable one to predict the temperature of the emitting 
dust. 

Several lines of argument indicate that there are significant numbers of 
grains comparable in size to the wavelength of blue light. For example, in 
§3.7.1 we mentioned that in the B band the albedo is high ~ 0.6 and that 
there is evidence for forward rather than isotropic scattering. Grains much 
smaller than 500 nm would scatter little light, and do so isotropically. 

Grains that are a fraction of a micron in size radiate fairly efficiently 
at temperatures much in excess of tens of Kelvin, with the result that they 
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are expected to be cool (T S$ 20K). Now the Planck function B,(T’) peaks 
at the Wien frequency wy = 2.82kT/h, which corresponds to wavelength 
Aw = 0.25(20K/T) mm. So by equation (8.46) a grain at 20K will radiate 
most strongly near 200 ym, i.e., in the sub-mm band. 

Infrared astronomy was revolutionized by the launch in January 1983 
of the Infrared Astronomy Satellite (IRAS). IRAS carried a 0.57 m diameter 
mirror, and thermal radiation from the optics was suppressed by cooling 
them to S$ 10K with liquid helium. IRAS surveyed almost the entire sky 
with an angular resolution of ~ 0.5arcmin and measured fluxes in bands 
centred on 12 zm, 25m, 60 zm and 100m. The spectral region longward 
of 100 zm, which, as we have just seen, is where many grains are expected to 
radiate most strongly, was surveyed during 1989-1995 by the Diffuse Infrared 
Background Experiment (DIRBE) aboard the Cosmic Background Explorer 
satellite (COBE). Broad summaries of the findings of IRAS can be found in 
Soifer, Neugebauer & Houck (1987) and Beichman (1987), while Boulanger 
& Pérault (1987) and Boulanger et al. (1996) give details of results from 
IRAS and COBE that relate to dust in the Milky Way. 

IRAS detected diffuse infrared radiation in all four of its wavebands. 
This demonstration that diffuse interstellar dust radiates significantly at 
wavelengths as short as 12j:m and 25 um precipitated a radical restructur- 
ing of models of interstellar dust. It is now believed that the ISM contains 
extremely small grains, containing < 100 atoms, in addition to the ‘classi- 
cal’ dust grains, 0.1 zm in size and containing 2 10000 atoms, that must 
be responsible for the high albedo and forward scattering of blue light dis- 
cussed above. These small grains dominate the signal in the shorter RAS 
wavebands for the following reason. When a UV photon is absorbed by a 
grain, its ~ 10eV of energy heats the grain. A grain with N atoms has 
~ 3N vibrational degrees of freedom and therefore maximum specific heat 
~ 3Nk.1®8 Consequently, the temperature to which the absorption of a single 
UV photon would heat a grain that is initially at T = 0 is 


_ WeV _ 38670 


For example, if N = 40 a 10eV photon will heat the grain from absolute zero 
to nearly 1000 K! A grain as warm as this will soon cool to a very much lower 
temperature, but in doing so it will radiate most of its 10eV windfall at a few 
microns. This process of randomly heating dust grains to high temperatures 
for short periods is called stochastic heating. 

Strong evidence for the importance of the stochastic heating of very 
small grains is provided by observations of reflection nebulae. Sellgren (1984) 
showed that the near-infrared spectra of three reflection nebulae, NGC 7023, 


18 Modes with frequencies v so high that hy > kT will contribute less than the classical 
3k to the specific heat. We assume that T is large enough for this phenomenon to be 
negligible. 
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NGC 2023 and NGC 2068, are independent of distance from the illuminating 
stars, and can be fitted by a model spectrum in which emission features are 
superposed on the Planck function for T ~ 1000 K. Moreover, the integrated 
2.2 ym and 3.8 um fluxes from the nebulae significantly exceed the fluxes at 
these wavelengths from the illuminating stars. This last observation proves 
that the observed fluxes are emitted rather than reflected by grains, and the 
large temperature characteristic of the spectrum proves that the grains are 
very small. The stochastic nature of grain heating follows from the failure 
of the spectrum of the emitted radiation to redden with increasing distance 
from the illuminating star: any body whose temperature is determined by 
a balance between thermal radiation from its surface and radiation received 
from a star distance r away, will become colder as r is increased. On the 
other hand, the mean temperature of radiation from a stochastically heated 
grain will be independent of r; removing the grain from the illuminating 
star merely decreases the frequency with which it will be hit by a photon 
and therefore the fraction of the time during which it will be effectively 
radiating. 

A grain containing only N ~ 50 atoms is as much a large molecule as a 
grain. Léger & Puget (1984) have suggested that many of these objects may 
be polycyclic aromatic hydrocarbons (PAHs), which are fragments of 
graphite sheets onto which hydrogen and perhaps some other atoms have 
adhered here or there. 

In view of the discoveries just described, the derivation of equation (8.46) 
above is over-simplified in that it assumes that all grains are at the same tem- 
perature. Observations discussed below show that at a given point in the 
ISM grains of different sizes have different temperatures. Fortunately, the 
derivation given can be generalized to include a range of grain sizes and tem- 
peratures simply by treating the ISM as a superposition of idealized media, 
in each of which all grains have the same size and temperature. Let n™ and 
KD be the density of the ith such medium and the extinction coefficient of 
its grains. Then equation (8.46) generalizes to 


Sek) = Syne) KPW)B(D), (8.52) 


where, as always vy = k/2rc. 
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8.2 The ISM in Disk Galaxies 


Observations in the Ha and 21-cm lines of HI and in the mm-lines of CO 
have yielded quite detailed maps of the distribution and kinematics of the 
cooler components of the ISM in over a hundred nearby galaxies — see Young 
& Scoville (1991) and Huchtmeier & Richter (1989) for references to the lit- 
erature. Our knowledge of the hot components of the ISM of nearby galaxies 
is much more rudimentary, although useful data are available from radio- 
continuum observations, and, to a lesser degree, X-ray observations. 

Some understanding of how surface densities and velocities of gas are 
extracted from line observations is necessary for the correct interpretation of 
the final data. The raw data are gathered by instruments of several types. 
Traditionally Ha measurements are made with a long-slit spectrograph but 
scanning Fabry—Perot interferometers are now increasingly used because they 
yield full two-dimensional information. The first radio-line observations were 
made with single dish instruments. In the early 1970s aperture-synthesis tele- 
scopes started taking 21-cm data, while the first aperture-synthesis observa- 
tions in the mm band date from the late 1980s. Consequently, the fraction 
of the available data that has the poor angular resolution characteristic of 
a single-dish instrument is much larger in the case of mm lines than in the 
case of the 21-cm line. 

With the Very Large Array (VLA), a giant aperture-synthesis telescope 
in New Mexico, it is possible to observe galaxies with a beam as small as 
~ lLarcsec. However, for most galaxies an acceptable signal-to-noise ratio can 
be achieved even by the VLA at such high resolution only after inconveniently 
long integration times. Consequently, most aperture-synthesis observations 
have been made with beams of diameter ~ 20 arcsec. Many of the best 
21-cm data derive from the Westerbork telescope in the Netherlands. Ha 
measurements can be made at angular resolutions of ~ larcsec but suffer 
from the severe limitation that only a very small fraction of the gas in galaxies 
emits Ha radiation, and the Ha-emitting gas is usually sparsely distributed. 

A significant weakness of every aperture-synthesis telescope should be 
noted: it is insensitive to large-scale smooth distributions of material, since 
these produce no fringes even on the telescope’s smallest base-lines. More- 
over, the sensitivity of the largest single-dish instruments is significantly 
greater than that obtainable with even the largest aperture-synthesis tele- 
scopes. Consequently, single-dish observations are still useful for determining 
how much cool gas a galaxy contains and how the gas is distributed over ve- 
locity. 

Observations made with either a scanning Fabry—Perot interferometer 
or an aperture-synthesis telescope yield a data cube: the intensity of ra- 
diation received at each frequency from each position in the field of view. 
These data are often displayed as a series of channel maps; each channel 
map shows the intensity received from every point in the field in a certain 
narrow range of frequencies. By the Doppler effect, each such range of fre- 
quencies corresponds to a range of velocities. A typical velocity range might 
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Figure 8.15 Twenty-five channel maps from the Westerbork aper- 
ture synthesis telescope for the Sc galaxy NGC 5033. Each channel 
map gives the brightness temperature measured through a 129 kHz- 
wide filter centered on the frequency corresponding to the heliocen- 
tric velocity quoted in kms~+ at bottom right. The sequence of 
maps should be read from top right to bottom left. Notice the 
asymmetry of many of the profiles; this is evidence that the disk 
of NGC 5033 is warped. [After Bosma (1978)} 


be 20kms7! wide, and ~ 30 channels might be employed to ensure that all 
gas in the galaxy, no matter what its heliocentric velocity, will register in at 
least one channel. Figure 8.15 shows 25 channel maps obtained in the 21-cm 
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Figure 8.16 Map of the HI col- 
umn density in the Sc galaxy NGC 
5033 superposed on a photograph 
of the system. Notice that the gas 
appears to cover a much larger 
area than the optical galaxy. [Af- 
ter Bosma (1978)] 
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line by the Westerbork aperture-synthesis telescope for the Sc galaxy NGC 
5033. At the bottom right-hand corner of each map is given the heliocen- 
tric velocity corresponding to the middle of the channel associated with that 
particular map. Consider now how one might derive for this galaxy the HI 
surface density shown in Figure 8.16 and the contour map of the lines of con- 
stant velocity shown in Figure 8.17. Essentially identical procedures would 
be involved in the interpretation of channel maps obtained in any spectral 
line. 

The first step is to isolate the line radiation from the continuum radia- 
tion. In this step, one examines each spectrum to decide which channels are 
line-free and then subtracts the average intensity of these continuum channels 
from each of the channels that contain line radiation. The next step involves 
assigning a typical velocity to each point in the galaxy by identifying the fre- 
quency at which the line radiation has peak intensity, or some other suitably 
chosen central frequency. Finally, one integrates over frequency the total 
line emission from each point to determine, via equation (8.22) for example, 
the column density at that point on the sky. A glance at Figure 8.18 shows 
that each of these steps is subject to an uncertainty that increases as the 
signal-to-noise ratio of the spectrum declines. When line centers and column 
densities have been assigned in this way to a grid of points on the sky, it is 
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Figure 8.17 Contours of con- 
stant HI velocity in NGC 5033. 
Notice the curvature of the kine- 
matic principal axes. [After Bosma 
(1978)] 


Figure 8.18 A typical velocity 
profile constructed from the channel 
maps shown in Figure 8.15. One 
possible estimate of the line center 
is marked V, and an estimate of the 
area of the line profile associated 
with the emission is shaded. [After 
Bosma (1978)] 


straightforward to construct HI density and velocity contour maps such as 
those shown for NGC 5033 and M81 in Figures 8.17 and 8.19. 

Notice that a high-quality line profile contains much more information 
than the values of the total column density and central velocity just dis- 
cussed. If the profile is approximately Gaussian, it is natural to measure 
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Figure 8.19 Contours of constant HI velocity in M81 overlaid on a gray-scale represen- 
tation of the surface density of HI. The scale at top gives column densities in units of 
1074 atomsm~?. Notice that in this high-resolution map, irregular motions associated 
with spiral arms are evident (cf. Figure 8.17). A central hole in the HI distribution is 


evident and the kinematic axes are curved. [After Rots (1975)] 
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its dispersion. If it is double-peaked, two characteristic velocities should be 
extracted, and so on. In general the number of parameters that should be 
extracted from a line profile increases with the signal-to-noise ratio of the 
data, and the complexity of the simplest galaxy model that fully exploits the 
extracted numbers will likewise increase with the quality of the data. (See 
§11.1.2 for a discussion of this type of problem.) 

Data cubes typically lack the spatial resolution of photographs, so that 
interpreting line data can be likened to trying to make out the structure 
of a galaxy from a plate which is badly out of focus. The natural measure 
of the resolution of observations of some galaxy is the ratio Ra/B of the 
scale-length Rg of the disk [see equation (4.20)] to the telescope’s half-power 
beam width B. Radial density and velocity profiles will not be significantly 
affected by beam-smearing provided Ra/B Z 2 (Bosma 1978). The sample 
of 16 nearby disk galaxies studied optically and in the 21-cm line by Wevers, 
van der Kruit & Allen (1986) will give an idea of the restrictiveness of this 
condition; Wevers et al.’s values of Rg ranged from 25 arcsec to 140 arcsec, 
while their beams had 27 arcsec < B < 60arcsec. Thus a study such as that 
of Wevers et al. will not always yield accurate radial profiles. Moreover, any 
structure smaller than Rg, for example spiral arms, will be heavily smoothed 
by the beam in all but the nearest galaxies. 

Low-resolution or noisy line data are best interpreted by a model-fitting 
procedure in which one attempts to optimize the fit between the observations 
and pseudo-data derived from a suitably parameterized model galaxy. The 
‘tilted-ring’ model to be described in 88.2.4 below is frequently employed for 
this purpose. 


8.2.1 Global measures 


HI and Hz in disk galaxies Since the total masses Myr and My, of 
HI and Hg in a galaxy can be derived from data of low spatial resolution, 
these masses have been determined for several hundred galaxies. The optical 
luminosities and values of My; and My, of these galaxies all cover enormous 
ranges. Consequently, the absolute values of the gas masses are of less interest 
than their relative magnitudes and the ratio of Mgas = Mur + Mu, to a 
dynamical estimate Mayn of the galaxy’s total mass. 

Figures 8.20 and 8.21 show the distributions of these ratios as functions 
of Hubble type. Although the spreads in the ratios at a given Hubble type 
are large, very clear trends of the ratios with Hubble type are apparent. 

Figure 8.20 shows that the ISM of a later-type spiral tends to account 
for more of the galaxy’s total mass than does the ISM of an early-type spiral. 
In fact, the median value of M,gas/Mayn increases by a factor of 5 from Sa 
to Sc. Figure 8.21 indicates that the increase in fractional gas content as one 
goes to later types is mainly a reflection of an increase in Myy: from Sa to Sc 
the median value My1/My, increases by just over a factor 5. Hence while 
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Figure 8.20 Early-type spirals are 
less gas-rich than late-type spirals. 
Here we show for galaxies of various 
Hubble types the ratio of gas mass 
to dynamical mass. The latter is es- 
timated by multiplying the luminos- 
ity by the typical mass-to-light ratio 
of the type from Rubin et al, (1985). 
{After Young & Scoville (1991) with 
additional data from the FCRAO 
Extragalactic CO Survey (Young et 
al. 1995) using data kindly provided 

=a 9. by J. Young] 

10g (Mgas/Mayn) 

Mur1/Mayn increases quite strongly down the Hubble sequence, My, /Mayn 
is approximately independent of Hubble type. This extremely interesting 
conclusion should be accompanied by a word of warning: My, has been 
obtained under the assumption that the CO-to-My, ratio is the same for all 
galaxies. Later-type galaxies tend to have lower luminosities and metallicities 
than earlier-type galaxies, and this might lead to the median CO-to-My, 
ratio decreasing down the Hubble sequence. Another source of anxiety is 
that observations of external galaxies yield systematically larger values of the 
ratio T'4(17CO)/T,4(?3CO) than do observations of molecular clouds in the 
Galaxy, upon which the standard CO-to-My, ratio, equation (8.28), rests. 
From §8.1.4 it will be recalled that larger values of T4(!2CO)/T,4(12CO) 
imply either that the radiation is emitted by more optically thin material, 
or that beam dilution is relatively important for the >CO measurements. It 
is natural that when a large telescope beam is turned on an external galaxy, 
a larger fraction of the CO radiation received derives from the optically 
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Figure 8.21 The ISM of an early- 

type spiral tends to be predomi- 

nantly molecular, while that of a 

late-type spiral tends to be atomic. 

L Sd/Sm Here we show for galaxies of vari- 
ous Hubble types the ratio of masses 
of interstellar gas in molecular and 
atomic form. [After Young & Scov- 

Al ille (1991) using data kindly pro- 

= ; vided by J. Young] 


sleds 
i) 1 7 
log[-M(H2)/(HI)] 


thin, low-density envelopes of clouds than is the case when the same beam 
is turned towards a molecular cloud complex in the Milky Way. One must 
worry, however, that any difference between the way in which molecular 
material is sampled in the Milky Way and in external galaxies should be 
reflected in a difference in the way My, is derived from T4(CO). 

Galaxies in the Virgo cluster have less HI, for any given luminosity, 
than non-cluster galaxies of the same Hubble type (Giovanelli & Haynes 
1983, van Gorkom & Kotanyi 1985, Warmels 1986). On the other hand, 
their Hz contents are approximately normal for their Hubble type (Kenney 
& Young 1989, Stark et al. 1986). We shall give a possible explanation of 
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Figure 8.23 For a sample of 31 
spiral galaxies the strength of line 
emission by 12CO is proportional to 
radio-continuum luminosity. Ico is 
a 3S ; the integral over all velocities of the 
7 j flux in the 2.6-mm 12CO line. Stot 
3 xe 7 is the flux at 1.5GHz. [After Adler, 
2 25 3 3.5 Allen & Lo (1991) using data kindly 
log(Sto./mJy) supplied by D. Adler] 
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these phenomena in §8.4 below. 


Radio-continuum and IR luminosities Figure 8.22 shows a tight cor- 
relation between the radio-continuum and infrared luminosities of galaxies. 
The luminosities shown here are at 6.3GHz and 100 ym, but a similar cor- 
relation could be shown for, say, the 1.4GHz and 12m luminosities. The 
correlation shown in Figure 8.22 is the more striking because of the wide 
range of galaxies to which it applies: at the low end NGC 2366 is an SBb 
galaxy while NGC 855 is a dwarf elliptical; at the high end NGC 4194 is 
a Magellanic irregular and Mk 231 is a starburst galaxy with a strong UV 
continuum. The fact that these very different galaxies lie along the same 
correlation suggests that their radio continuum and IR luminosities are gen- 
erated by a single process. That process is thought to be the formation of 
massive stars. UV photons from these stars are converted into IR radiation 
by dust grains, while the supernovae that mark the death throes of the most 
massive stars generate the cosmic rays that are responsible for the observed 
radio continuum radiation. 

Figure 8.23 shows another interesting correlation between radio contin- 
uum luminosity and a measure of the cold component of the ISM by plotting 
the 1.5 GHz radio-continuum flux, Si.¢, against the flux Ico of 2.6mm ra- 
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diation by !2CO molecules. These two variables prove to be approximately 
proportional to one another: Ico / Stop = 1.3+0.6kms7!. 

The correlations of Figures 8.22 and 8.23 suggest a close physical connec- 
tion between cool gas, star formation and supernovae. The unusually tight 
correlation of Figure 8.22 probably arises because both the plotted variables 
directly reflect the rate of formation of massive stars by measuring energy 
released by these stars: UV photons to heat the dust in the case of the IR 
luminosity and bulk kinetic energy to accelerate cosmic rays in the case of 
the radio luminosity. Is the correlation of Figure 8.23 of the same nature? 
That is, does increased CO luminosity reflect increased heating of molecular 
gas by young stars, perhaps through heating by cosmic rays? In this case, 
some galaxies might possess clouds that are rich in CO but inconspicuous 
in the spectral lines of CO because they are deficient in liminous stars to 
heat the CO (Allen 1992). Alternatively, the correlation of Figure 8.23 may 
merely indicate that galaxies with lots of cool gas form lots of stars and thus 
generate plenty of cosmic rays. 

Although radio-continuum luminosity is tightly correlated with diag- 
nostics of cold gas, it does not come from the thin disk to which cold gas 
is confined. Figure 8.24 illustrates this phenomenon by showing the radio- 
continuum surface brightness of the edge-on disk galaxy NGC 891: the op- 
tical image of the same galaxy that is shown in Figure 1.4 makes it clear 
that the emission of Figure 8.24 is not confined to the disk. The actual 
distribution of radio-continuum luminosity is naturally explained if cosmic 
rays diffuse more than a kiloparsec from their places of birth before they 
cease to radiate at a few GHz; this diffusion is most apparent in making 
the disk of synchrotron emission a few kiloparsecs thick, but it also causes 
the scale-length of the synchrotron disk to be larger than that of the parent 
distribution of luminous stars. 
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Figure 8.25 The heavy line shows 
the B-band surface brightness of the 
Sc galaxy NGC 6946 expressed as a 
number of solar luminosities per pc?. 
The dashed and light curves show, 
respectively, the surface densities of 
7 Hz and HI in units of 1024 atoms 
10 207 30 per m?. [From data published in 
R/kpe Tacconi & Young (1986)] 


a 


logio(N) 


/(Mope) 
oN F ® OO 
> 

crtisitirt 


hil 


b 


NGC 7331 
20 


1 oti 


Figure 8.26 Surface densities 

of N(H2) (triangles) and N(HI) 
(squares) in the Sb galaxies NGC 
2841 and NGC 7331. [After Young 
t & Scoville (1983) using data kindly 
supplied by J. Young] 
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8.2.2 Radial density profiles 


Figure 8.25 shows the surface densities N(H2) and N(HI) as a function of 
radius in the Sc galaxy NGC 6946. Also shown is the galaxy’s B-band optical 
surface-brightness profile. These data illustrate several general features of 
the distribution of gas in late-type galaxies. First, Hz is very much more 
strongly concentrated towards the center of the galaxy than is HI. Second, 
the distribution of HI shows a slight depression at the center. Third the HI 
distribution is more extended than the distribution of star-light. 

Figure 8.26 shows Hz and HI profiles for two Sb galaxies, NGC 7331 
and NGC 2841. Again H2 is very much more centrally concentrated than 
HI. However in these two galaxies there are central depressions in both the 
H, and the HI distributions rather than in the latter only. Young & Scoville 
(1991) report that not quite half of all Sb galaxies have such central depres- 
sions in their Hz distributions. The Andromeda galaxy, M31, provides the 
classic example of this phenomenon. Early-type disk galaxies generally have 
central depressions in their radial HI profiles. The few radial Hz profiles 
that are available for early-type disk galaxies resemble the Hz profiles of Sc 
galaxies such as that shown Figure 8.25. In particular, the He profiles of 
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early-type galaxies peak at the center and at values of N(H2) similar to the 
peak values (1076 m~?) attained at the centers of Sc galaxies. 

The great extent of the HI distributions of some galaxies has been of cru- 
cial importance for studies of dark matter since HI provides the perfect probe 
of a galaxy’s circular speed and thus of its mass as a function of radius. With 
this application in mind, Broeils & van Woerden (1994) searched for objects 
with unusually extended HI distributions. Their data were obtained with 
the Westerbork telescope, but they used it in a quick-look mode pioneered 
by Warmels (1986), rather than as an aperture synthesis instrument. In this 
mode the Westerbork array can produce a radial surface-density profile in 
two hours rather than the twelve hours required for an aperture-synthesis 
map. The HI surface-density profiles obtained by Broeils & van Woerden 
(1994) show a wide variety of shapes, and are frequently quite different on 
the two sides of the galaxy. All galaxies in the sample have more than 
1Mo pe? of HI at Res and NGC 6674, for example, reaches this level only 
at R/Ro5 ~ 1.8.19 In fact, the mean relation between the radius Ry at 
which 1Mo pe? of HI is reached and Ros is?° 


logyo Rut = (0.9 + 0.1) logy) Ros + (0.26 + 0.07). (8.53) 


Few galaxies in the Broeils & van Woerden sample show extended emis- 
sion at surface densities below 1024m~?. In fact, there is now considerable 
evidence that HI disks cut off very abruptly at a characteristic surface density 
Nur = 1073 m~? (van Gorkom 1993). Maloney (1993) shows that the likely 
cause of this cutoff is a phenomenon predicted by Sunyaev (1969): HI exists 
only where it is shielded from the cosmic background flux of ionizing photons 
by a layer of nearly fully ionized material. For plausible parameters Maloney 
finds that this shielding layer has column density ~ 3 x 1023 m~?. Once the 
column density of a galactic disk approaches this value, the HI layer at its 
center becomes very thin, and altogether vanishes at lower column densities. 
If this model is right, HI disks are embedded in ionized gas that must glow 
in the Ha line. Moreover, these Ha disks will extend to considerably greater 
radii than the HI disks. We shall see in §9.3.3 that the disk of the Milky 
Way is indeed enveloped in Ha emission. 

We have seen (i) that late-type spirals are on the average more gas- 
rich than early-type spirals because they are richer in HI rather than in Hg, 
and (ii) that HI tends to be stored at large radii, often outside a galaxy’s 
optical image. These facts suggest that a galaxy’s Hubble type may be 
at least in part determined by its HI content and that the latter may be 
susceptible to environmental influences. This conjecture is supported by the 
morphology-density relation (§4.1.2) which states that late-type galaxies are 


19 See §2.3.6 for the definition of Res. 
20 The values of R25 for which equation (8.53) was established were corrected for incli- 
nation and internal extinction. 
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Figure 8.27 The correlation of CO and HI emission with spiral arms in the tidally 
disturbed galaxy M51. The left panel shows contours of CO emission overlaid on a gray- 
scale representation of the red-continuum light. The right panel shows the same contours 


overlaid on a gray-scale representation of the 21-cm emission. [After Rand & Kulkarni 
(1990) courtesy of R. Rand] 


comparatively rare in crowded regions. Moreover, we have seen that spiral 
galaxies in the Virgo cluster tend to be deficient in HI, while Szomoru, van 
Gorkom & Gregg (1996) find that galaxies in the Bootes void are unusually 
gas-rich. In §8.4 we shall discuss several processes that would tend to deprive 
cluster galaxies of outlying HI. 


8.2.3 Azimuthal distributions 


Bars and oval distortions Gas (mostly H2) near the centers of galaxies is 
frequently found to form a bar, even in systems whose optical luminosity does 
not appear to be barred. When an optical bar is present, the Hz distribution 
is usually strongly peaked along the bar. 

It has been speculated that the outer disks of galaxies might be mildly 
elliptical rather than axisymmetric (e.g., Binney 1995). The difficulty that 
tests of this conjecture have to overcome is resolving the ambiguity between 
an intrinsically elliptical disk and a circular disk seen at a slightly different 
inclination angle. We shall see below that this is best done by modeling the 
disk’s velocity field. 


Spiral structure Spiral structure may be traced in both Hz and HI distri- 
butions. Figure 8.27 illustrates this phenomenon in the case of the ‘whirlpool’ 
galaxy, M51. M51 has an unusually beautiful two-armed spiral structure be- 
cause it is being tidally disturbed by a companion — several dynamical studies 
of this system can be found in Wielen (1990). It is likely that M51 displays 
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the basic physics of spiral structure in exaggerated but nonetheless general 
form. The left panel of Figure 8.27 shows contours of CO emission superim- 
posed on the distribution of red light, which should be a reliable tracer of 
the underlying mass distribution in the galaxy’s disk. Careful inspection of 
this panel reveals that the CO emission peaks just on the concave side of the 
ridge-line of the spiral arms as delineated by red light. Dynamical studies 
indicate that gas flows through the arms from concave side to convex side. 
Hence the CO emission peaks just upstream of the arm’s crest in the mass 
distribution. 

The right panel of Figure 8.27 shows contours of CO emission superim- 
posed on the HI distribution. The HI can be seen to peak slightly towards 
the arm’s convex side relative to the peak in the CO emission. Thus the HI 
density appears to peak slightly downstream of the peak in CO. Ha emission 
similarly peaks downstream of the peak in CO. The near coincidence of the 
Ha and HI emission suggests that both arise through heating and dissocia- 
tion of molecular gas by hot short-lived stars that form where the molecular 
density and CO emission are greatest. 

It must be emphasized that Figure 8.27 considerably exaggerates the 
arm/inter-arm contrast in the CO emission; the data were gathered with an 
interferometer which was insensitive to smoothly distributed emission. It 
turns out that ~ 75% of the CO emission is smoothly distributed and only 
~ 25% is concentrated into the arms. 

The HI surface density distribution of M81 depicted in Figure 8.19 shows 
one of the most distinct and regular spiral patterns seen in 21-cm line radia- 
tion (Rots 1975, Rots & Shane 1975). Out to r = 10kpc, the HI distribution 
is dominated by a clear two-arm spiral pattern, which is well approximated 
by a logarithmic spiral?! with a pitch angle of about 15°. The ratio of the 
peak HI density in the arms to the average HI density near them is about a 
factor of 2. The HI arms show an excellent correlation with optical arms; the 
HI ridge line tends to lie on the inner (concave) edge of the bluest part of the 
optical arm (which presumably is the region containing the highest density 
of young blue stars). A narrow dust lane, seen optically, lies along the inner 
edge of the eastern HI arm. The phase lag from the HI peak to the blue light 
peak is about 10° to 15°. Using an estimated pattern speed (§4.4.7) for the 
spiral structure (chosen to place the corotation radius at Rog ~ 10.5kpc), 
one deduces that the time required for an element of gas to pass from the 
maximum HI surface density to the highest. density of young stars is 10” yr. 
The ratio of hydrogen density to stellar mass density increases progressively 
along the arms; the HI arms can be traced far beyond the end of the optical 
arms. Outside R ~ 10kpc, two outer-arm features develop. The northern 
arm is crisply defined and can be followed easily over several kiloparsecs. The 
southern arm is fragmented, and it joins into the two large HI concentrations 
to the east of the galaxy. 


21 If (R,¢) are polar coordinates, then along a logarithmic spiral, In(R) = tan(w)¢ + 
constant, where ~ is the pitch angle. 
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Figure 8.28 A map of M101 in the 21-cm line made with the 
Westerbork Synthesis Radio Telescope. [After Kamphuis, Sancisi 
& van der Hulst (1991) using data kindly provided by T. van der 
Hulst] 


In addition to the spiral structure just discussed, Figure 8.19 shows that 
on a small scale, the HI distribution of M81 appears to break up into a large 
number of small blobs The patchiness of the HI distribution of Figure 8.19 
is commonly observed in all types of late-type galaxy. Indeed, one finds that 
even in low-mass systems like the Magellanic Clouds (McGee & Milton 1966, 
Hindman 1967) and other low-luminosity systems (Tully et al. 1978), the 
neutral hydrogen is concentrated into complexes that each contain around 
5 x 10° Mo of HI and are a few hundred parsecs in diameter. Probably the 
fact that low-mass systems can contain only a few of these complexes (< 50 
for an Im galaxy), whereas giant galaxies like M81 contain many hundred 
such complexes, contributes to the irregular appearance characteristic of low- 
mass, late-type systems. 


Lop-sidedness_ Gas that lies outside a galaxy’s optical image frequently 
displays features that would not arise if the gas disk were axisymmetric 
and planar. The HI distribution of M101 shown in Figure 8.28 illustrates 
the phenomenon of lop-sidedness: the HI appears to extend much further 
towards the north-east than in the opposite direction. In this particular 
example, the contours of constant HI density are approximately circular at 
all radii, but at large radii they are not concentric with the galactic nucleus. 
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Figure 8.29 Each panel shows the integrated 21-cm line profile 
of a disk galaxy. The fairly symmetrical profiles in the lower two 
panels are for galaxies that are not lop-sided, while the asymmetric 
profiles shown in the upper two panels are for significantly lop-sided 
galaxies. [After Richter & Sancisi (1994) courtesy of R. Sancisi] 


This observation suggests that gas at large radii might be orbiting around a 
point offset from the nucleus. Actually, maps of the velocity field of the gas 
(see below) indicate that even at large radii, gas is moving on circular orbits 
around the nucleus. Thus the lop-sidedness of the HI distribution must arise 
because gas is non-uniformly distributed around each circular orbit, rather 
than because the orbits are strangely centered. Since the period of rotation 
of a gas cloud is roughly proportional to its galactocentric distance, in a 
few rotation periods ~ 1 Gyr, differential rotation should smear an initially 
lop-sided distribution of gas into an axisymmetric distribution. Hence it 
surprising that lop-sided distributions of HI are common. Galaxies such as 
the LMC of type Sdm show this phenomenon to a particularly marked extent 
(de Vaucouleurs & Freeman 1970) and in §9.2.6 we shall see that the Milky 
Way is itself probably lop-sided. 

Richter & Sancisi (1994) pointed out that lop-sidedness can be detected 
from velocity profiles such as those shown in Figure 8.29: typically, lop- 
sidedness causes there to be more gas on one side of the kinematic minor axis 
than on the other, with the result that one horn of the integrated velocity 
profile is larger than the other. Richter & Sancisi (1994) exploited this fact 
to determine the frequency of occurrence of significant lop-sidedness in 1371 
galaxies for which good-quality integrated velocity profiles were available. 
They found that at least 50% of the galaxies are lop-sided. There is no 
generally accepted explanation of why lop-sidedness is so common given the 
comparatively short lifetime of an initially lop-sided distribution mentioned 
above. 
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NGC 5907 


Figure 8.30 The warped neutral hydrogen disk of the nearly edge- 
on galaxy NGC 5907. Emission by gas that is moving near the sys- 
temic velocity of the galaxy has been suppressed for clarity. NGC 
5907 has no nearby neighbors that could have recently disturbed it 
tidally. [After Sancisi (1976) courtesy of R. Sancisi| 


One point that may be significant in this context, is that the surface 
density of all gas, neutral and ionized, could be nearly axisymmetric even 
in lop-sided galaxies. Indeed, at the large radii and low column densities 
~ 1074m~? of HI at which lop-sidedness is commonly observed, HI may 
constitute only a small fraction of the total column density, and large fluc- 
tuations in its density may reflect comparatively small fractional variations 
in either the total column density or the ionizing flux incident on the disk — 
see Problem 8.9. 


In Figure 8.30 two channel-maps of the edge-on spiral galaxy NGC 
5907 are superimposed on a photograph of the system. The velocities of 
the channel-maps shown are centered on 190kms~! larger and 190kms~+ 
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smaller than the systemic velocity of NGC 5907.22, Channel-maps at veloc- 
ities that differ by more than 190kms~! from the systemic velocity show 
little emission. Hence rotation velocity of gas in the disk of NGC 5907 must 
lie near 190kms~. But if the disk were planar and its material rotating 
on circles, the material displayed in Figure 8.30 would be symmetrically dis- 
posed along the galaxy’s apparent major axis, rather than to the left of the 
major axis in the south and to the right of it in the north. The obvious 
explanation of this anomalous disposition of material in Figure 8.30 is that 
the disk is not planar, but warped. 

Only a small fraction of galaxies are viewed as nearly edge-on as NGC 
5907 and any warp in a less edge-on galaxy would not be immediately appar- 
ent in the angular distribution of HI. In the next subsection we shall learn 
how to diagnose warps from the observed velocity fields of galaxies, and see 
that they are as common as lop-sided galaxies. 


8.2.4 Velocity fields of disks 


We have so far concentrated on the total intensity of radio-line radiation 
coming from each point in a disk, regardless of frequency. We now turn our 
attention to the central frequency of the received radiation. As usual, we 
interpret the frequency of the radiation detected via the Doppler formula 
as a velocity. In many cases a central velocity and a dispersion about this 
velocity provide an adequate description of the observed line profile. Oc- 
casionally we shall find that the line profile requires a more sophisticated 
velocity distribution (cf. §11.1.2). 

Galactic disks are frequently modeled by a collection of concentric, cir- 
cular rings — see Figure 8.36. Let r be a vector from the galactic nucleus to 
an arbitrary point in the disk, and let n(r) be the unit vector normal to the 
ring of radius r. Then if Q(r) is the angular velocity of this ring, the velocity 
of material at position r is v = Q(r) fi x r and the line-of-sight velocity is 


Vos = R-v =A(r)r- (Rx fi), (8.54) 


where R is the unit vector from the observer towards the galaxy. The general 
modeling procedure involves varying for each ring 2 and the polar angles 
(9,¢) of fi so as to optimize the fit between the values of vo, predicted by 
equation (8.54) and those extracted from the line profiles for a grid of points 
on the sky. 

It is instructive to consider the predictions of equation (8.54) in the case 
of a flat disk. The normals fi of all rings then coincide, and we may simplify 
equation (8.54) to 

Vos = A(r) sinir - k, (8.55) 


22 See page 507 for the definition of systemic velocity. 
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Figure 8.32 The spider diagram 
generated by the circular-speed curve 
of Figure 8.31 when the system is 
viewed at inclination 1 = 30° with 
the apparent major axis horizontal. 
The area contoured is a square 10 
distance units on a side. 


where 7 is the galaxy’s inclination and 


k=Rx fi/sini (8.56) 


is the unit vector perpendicular to both R and a; k runs parallel to the 
disk’s the line of nodes, which is also the galaxy’s apparent major axis. 
Figure 8.31 shows a plot of the rotation velocity u.(r) = rQ(r) of a 
typical galaxy. The central part of the galaxy, in which Q(r) ~ constant and 
Ue x 7, is called the region of solid-body rotation because here the galaxy 
rotates almost as if it were rigid. Figure 8.32 shows contours of constant vjos 
in the sky for the rotation curve of Figure 8.31; such a velocity-contour plot 
for a disk is called a spider diagram. Notice that in Figure 8.32, the 
velocity contours run straight and parallel to the apparent minor axis of the 
galaxy in the region of solid-body rotation. The decline in the rotation curve 
of Figure 8.31 beyond r ~ 3, gives rise in the spider diagram to a closed 
contour that is elongated along the apparent major axis of the galaxy. If the 
rotation curve of Figure 8.31 had not peaked but had gone on rising as some 
rotation curves do, or if it had risen to some constant value, there would 
be no closed contours in the spider diagram. The kinematic minor axis 
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Figure 8.33 The upper panels show the rotation curves of three Sa galaxies of very 
different luminosities from the sample of Rubin et al. (1985) plotted both on the same 
linear scale (left) and rescaled by their optical radii, R25 (right). The lower panels show 
similar plots for three Sc galaxies from the sample of Burstein et al. (1982). 


is the locus of points having the same velocity as the nucleus of the system 
(the systemic velocity), whereas the kinematic major axis is the curve 
which runs through nucleus and is everywhere perpendicular to the local 
constant-velocity contours. In Figure 8.32, the kinematic major and minor 
axes coincide with the apparent optical axes. In the case of a warped or 
elliptical disk, the two pairs of axes are distinct. 


Circular-speed curves The circular-speed curve v,(r) of a galaxy is of 
fundamental importance because from it one can deduce the mass M(r) 
interior to radius r — see BT §2.6. Burstein et al. (1982) and Rubin et al. 
(1985) have extracted circular-speed curves from long-slit optical spectra of 
about 60 spiral galaxies of all types. Their circular-speed curves rarely extend 
to Ros,23 but usually extend beyond 5 Ros. Figure 8.33 shows the curves of 
their most and least luminous Sa and Sc galaxies, and of randomly chosen Sa 
and Sc galaxies of intermediate luminosity. Each curve is plotted twice, once 
as a function of radius measured in kiloparsecs (left panels) and once as a 
function of radius scaled by Ros (right panels). The only clear generalization 
that is suggested by Figure 8.33 is that the speed achieved by a rotation curve 
at large R/ Ros is positively correlated with luminosity — equation (7.26) puts 


23 See §2.3.6 for the definition of Res. 
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Figure 8.34 In NGC 753 (left panel) the shape of the optically observed circular-speed 
curve (crosses) cannot be reproduced by assigning a mass-to-light ratio YT to each of the 
disk and the bulge and the calculating the resulting circular speed (full curve). In the case 
of NGC 801 (right panel), by contrast, the two curves are in excellent agreement. The 
dashed and dot-dashed curves show the calculated contributions to uv, from the disk and 
the bulge, respectively. [After Kent (1986)] 


this important correlation onto a quantitative basis. Amongst the Sc rotation 
curves there is a suggestion that luminous Sc galaxies attain their maximum 
rotation speeds at smaller values of R/R2s5 than do faint Sc galaxies, with 
the result that at large values of R/ Ros the rotation curve of a luminous Sc 
galaxy is more likely to be flat or falling than is the rotation curve of a faint Sc 
galaxy. This type of luminosity dependence in the form of v.(R/Re5) is not 
evident in the Sa rotation curves of Figure 8.33.24 Indeed, upper right-hand 
panel in Figure 8.33 shows that the general shapes of the rotation curves of 
the faintest and most luminous Sa galaxies in the Rubin et al. sample are 
rather similar. 

The correlation between a galaxy’s luminosity and the amplitude of its 
circular-speed curve that is so apparent from Figure 8.33, is quantified by 
the Tully—Fisher relation that was discussed in §7.3.4. 

Kent (1986) has presented r-band CCD photometry for most of the 
galaxies in the Rubin et al. (1985) sample and has shown that the over- 
whelming majority of the observed circular-speed curves are in reasonable 
agreement with the circular-speed curves predicted by assigning plausible 
mass-to-light ratios Y(d) and Y(b) to the galaxies’ disks and bulges. In 
many cases the last point of the observed curve is higher than theory pre- 
dicts, and in two cases theory and observation are in significant disagreement. 
Figure 8.34 shows one of these cases, that of NGC 753, and a case in which 
the agreement is good. Kent finds that Y,(d) lies in the range 1.2T,(©) to 
7.8Y,(©) and in all but two cases T,(b) lies in the range 1.2T,(©) to 9T,(0). 
The last point on the circular-speed curves usually lies inside 0.7 Ro5 ~ 2.3Ra, 
where Rg is the scale length defined by equation (4.20). 

21-cm observations enable one to determine circular-speed curves out- 
side the optically conspicuous parts of galaxies. At radii sufficiently large 


24 But see Rubin et al. (1985) for a contrary point of view regarding Sa rotation curves. 
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to contain all but a negligible fraction of the galaxy’s luminosity, any model 
which assigns fixed mass-to-light ratios to disk and bulge will predict that the 
circular speed declines as v, ~ r~'/?. Actually, the observed circular-speed 
curves very rarely show a significant decline at the largest observed radii. 
Moreover, the mass required to keep the circular-speed curve flat or rising at 
large radii is in many cases comparable to or larger than the masses of the 
disk and the bulge that one infers from the structure of the circular-speed 
curve inside Ros. 

The most carefully documented case of this phenomenon is that of NGC 
3198 studied by van Albada et al. (1985). NGC 3198 is an Sc galaxy at a 
distance of ~ 9.4Mpc. Its velocity field can be accurately fitted by a tilted- 
ring model in which the apparent major axes of the rings are aligned to within 
1° and the inclinations of the rings vary systematically with radius by about 
8°. The circular-speed curves independently derived from the northern and 
southern halves of the galaxy agree to within 2kms~!. Thus there can be 
little doubt that the observed HI is on circular orbits about the galactic 
center. The lower panel in Figure 8.35 shows the observed circular-speed 
curve (points) as well as that predicted by adopting a mass-to-light ratio 
Y, = 3.8Y,(@©) for the stellar component and taking into account the HI 
mass revealed by the 21-cm observations. The upper panel in Figure 8.35 
gives the r-band brightness profile from Kent (1987). From Kent’s CCD 
photometry the disk scale-length is Rg = 55 arcsec = 2.5kpc and last point 
on the measured circular-speed curve is at 11.2arcmin = 30kpc, so the 
circular-speed curve extends to 12Rqg. Van Albada et al. show that there 
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must be at least three times as much mass within 12Rq as within Ros. 

Kent (1987) and Begeman (1987) showed that circular-speed curves that 
extend to more than ~ 5Rg generally require significant masses of material to 
reside beyond Ra. Several authors have endeavored to map the distribution 
of this ‘dark matter.’ The procedure adopted is to assume that the galaxy is a 
superposition of a bulge, a disk and a dark halo. The luminosity distributions 
of bulge and disk are inferred from photometry, so their mass distributions 
are determined by their assumed mass-to-light ratios T(z). The dark halo 
emits no light and thus has a completely unknown spatial distribution. Most 
authors assume that it is spherical with density 


Po 
= ——_ 8.57 
ple) = ae (8.57) 
where po and a are parameters to be determined. Unfortunately, this func- 
tional form is physically unmotivated. N-body simulations of gravitational 
clustering suggest that dark halos have density profiles that can be approxi- 
mated by the NFW law (Navarro, Frenk & White 1997) 


Mo 
p(r) = aan (8.58) 
where Mo and a are free parameters. Consequently, several studies have 
employed this law. Unfortunately, whichever profile one adopts, one finds 
that most circular-speed curves can be adequately fitted by models that have 
wildly differing values of the free parameters, depending on what assumptions 
are made regarding the values of T(z) for disk and bulge. 

In general there is no reason to assume that dark matter is distinct from 
luminous matter. All that we know for certain is that Y increases sharply 
as the surface brightness of galaxies falls below a fraction of a percent of 
the brightness of the night sky. Whether this increase reflects a change in 
the mass-to-light ratio of stellar populations or the emergence of a tenu- 
ous distribution of utterly dark material cannot be ascertained by studying 
circular-speed curves and photometric profiles alone. 


Kinematic warps The observed spider diagrams of most galaxies show 
significant deviations from the classical form shown in Figure 8.32. So con- 
sider how a galaxy’s spider diagram will be affected by a warp such as that 
evident in Figure 8.30. We suppose that this warp can be well represented by 
the tilted-ring model (right-hand panel of Figure 8.36). Each ring projects 
into an ellipse on the sky, and within this ellipse the velocity contours will 
be as in Figure 8.32. In particular, within each ellipse, the kinematic axes 
will be mutually perpendicular. But on account of the warp, the directions 
of the kinematic axes will vary from ellipse to ellipse. Hence, the vector k 
defined by equation (8.56) becomes a function of r through f(r), and the 
kinematic major axis will twist so as to follow the changes in the direction 
of k. Figure 8.37 illustrates this phenomenon in the case of M83. 
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Figure 8.36 A tilted ring model of M83 (right) and the spider diagram predicted by this 
model (left). [After Rogstad, Lockhart & Wright (1974)] 
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Figure 8.37 The observed spider 
diagram of M83. [After Rogstad, 
Lockhart & Wright (1974)] 


Figure 8.38 When a strongly 
warped disk is viewed nearly edge- 
on, some lines of sight cut the 
warped disk twice, and the line- 
profile is liable to be multi-peaked. 


If the amplitude of a warp is sufficiently large and the galaxy is suffi- 
ciently highly inclined, the line of sight may in places pass through the disk 
twice — see Figure 8.38. Along such lines of sight the line profile will normally 
have more than one peak, and it will be necessary to associate more than 
one velocity with a given line of sight. Both our close neighbors, M31 and 
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Figure 8.39 The HI constant-velocity contours of NGC 5383 su- 
perimposed on an optical image of the galaxy. The spider diagram 
is approximately twofold symmetric, but the kinematic major and 
minor axes are by no means perpendicular. This is the signature of 
an elliptical disk. [After Sancisi, Allen & Sullivan (1979) courtesy 
of R. Sancisi] 


M33, exhibit this phenomenon (Rogstad, Wright & Lockhart 1976, Brinks 
1984). 

Evidence of warps of the type just described led Bosma (1978) to con- 
clude that 8 of a sample of 25 galaxies are warped. Briggs (1990) studied a 
sample of 12 galaxies with high-quality 21-cm data and concluded that warps 
tend to develop between R25 and the Holmberg radius (§2.3.6). Our under- 
standing of the dynamics of warps, which is still primitive, was reviewed by 
Binney (1992). 


Oval distortions In observed spider diagrams the kinematic axes are 
sometimes clearly not mutually perpendicular — see Figure 8.39. This phe- 
nomenon is the signature of gas moving on elliptical rather than circular 
orbits. Franx, van Gorkom & de Zeeuw (1994) analyze the appearance of 
elliptical disks. The details of this analysis are unfortunately complex and 
model-dependent; complex because cumbersome rotation matrices are in- 
volved, and model-dependent because the relationship between a disk’s shape 
and velocity field depends on the nature of the driving non-axisymmetric 
gravitational potential. By way of illustration, Figure 8.40 displays two spi- 
der diagrams of an elliptical disk that is seen nearly face on (i = 1°).25 Gas 
in this disk moves on oval orbits, one of which is shown. Notice that in both 
panels the systemic-velocity contour is straight and cuts the oval where it 


25 Figure 9.35 shows the spider diagram of a highly inclined elliptical disk. 
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Figure 8.40 The effect of an oval distortion on the spider diagram. In both panels the 
normal to the plane of the disk is inclined at 1° to the line of sight. In the left panel the 
major axis of the elliptical disk makes an angle of 20° with the projection of the line of 
sight into the disk plane, while this angle is 70° in the right panel. The dotted line shows 
the line of nodes while the ovals show the projections of a single orbit. The dashed lines 
are the projected principal axes of the orbit. 


runs parallel to the line of nodes (shown dotted). Contours of equal pro- 
jected surface density within the disk will not necessarily coincide with the 
projections of individual orbits, but they will have the same principal-axis 
directions as the orbits (dashed lines in Figure 8.40). 

The key points about the velocity fields of elliptical disks that are il- 
lustrated by Figure 8.40 are: from most viewing angles (i) the kinematic 
axes are not perpendicular, although the photometric axes are for face-on 
orientation, (ii) the kinematic and photometric minor axes do not coincide, 
and (iii) the kinematic major axis lies close to the line of nodes, especially 
at high inclinations. It follows from point (ii) that there will usually be a 
velocity gradient along the photometric minor axis. Measurements of veloc- 
ity gradients along photometric minor-axes have been used to constrain the 
ellipticities of disks; if the outer disks of many spiral galaxies are elliptical, 
their average ellipticity cannot exceed € ~ 0.05 (Franx & de Zeeuw 1992). 


8.2.5 SO galaxies 


21-cm observations of SO galaxies confirm that these systems do indeed oc- 
cupy the intermediate position between ellipticals and spirals. Wardle & 
Knapp (1986) studied statistically the results of several studies of HI in SO 
galaxies with sensitive single-dish radio telescopes. Their sample consisted of 
305 galaxies, of which 85 had detectable quantities of HI. This detection rate 
is about twice that encountered in studies of HI in ellipticals — see below. 
The detection rate of galaxies whose classification is ambiguous between SO 
and Sa, designated S0/a, is roughly twice that for unambiguous SO galaxies. 
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Thus the detection rate increases steadily as one proceeds down the Hubble 
sequence. 

The lower two curves in Figure 8.41 give the probability that a given 
galaxy of type SO or S0/a will be found to have My/Lg smaller than 
any given value. From this it can be seen that half of SO galaxies have 
Mu/Lg < 10-3 Mo ) while a quarter have My /Lg > 0.03 Mo Le: 
The corresponding numbers for SO/a galaxies are that half have My1/Lg < 
0.02. Me Lg! and a quarter have Myy/Lp > 0.08 Mo Lo’. 

van Driel & van Woerden (1991) and their collaborators have mapped 
the distribution of HI in 24 early-type disk galaxies. They found that in 
these galaxies HI is frequently concentrated into rings rather than forming 
a filled disk, as it usually does in unbarred spiral galaxies. The HI-velocity 
fields are consistent with the gas clouds that make up the rings being on 
circular orbits. The disks of SBO galaxies are generally virtually free of HI 
inside a circle whose radius is of order three times the length of the bar’s 
semi-major axis. In a few cases tails of HI suggest that gas is being accreted 
by a lenticular from a neighboring object. 

The azimuthally-averaged surface density of HI in the gaseous rings is 
significantly smaller than in later-type disks: in SO galaxies peak values are of 
order 1. Mg pce~?, while in S0a galaxies values up to 3. Mo pe~? are achieved. 
This may be compared with the value ~ 10 Mo pc~? which is characteristic 
of the peak in the HI profile of an Sc galaxy such as that shown in Figure 8.25. 
van Driel & de Jong (1990) have confirmed that the dust responsible for the 
weak dust lanes observed in S03 galaxies gives rise to detectable IR emission. 
Sofue & Wakamatsu (1993) present evidence that dust in early-type galaxies 
is also associated with emission by CO, at roughly the same level as in spirals. 
We shall see below that some SO galaxies contain significant quantities of very 
hot, X-ray emitting gas. 

Van Driel & van Woerden divide HI rings in lenticulars into inner and 
outer rings depending on whether their radii are less than or greater than 
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Figure 8.42 The spindle galaxy, NGC 2685. [Courtesy A. Sandage] 


Res. Inner rings have typical radius 0.7R25, while outer rings have typical 
radius 1.8R25. About half of both inner and outer rings are significantly 
tilted with respect to the plane of the stellar disk; the outer rings are fre- 
quently tilted through large angles (2 30°). Figure 8.42 shows an extreme 
example of this phenomenon, NGC 2685. Sandage (1961-1995) dubbed this 
the spindle galaxy because on photographic plates the object looks like a 
ring around a spindle. Optical and 21-cm spectroscopy subsequently showed 
that the ‘spindle’ is, in fact, an edge-on SO galaxy whose rotation axis is 
almost perpendicular to the axis about which a gas-rich ring rotates. Galax- 
ies like this, in which a ring runs nearly over the poles of the galaxy, are 
called polar-ring galaxies. Whitmore et al. (1990) have catalogued ~ 70 
galaxies that are, or may well be, of this class. They conclude that ~ 5% of 
SO galaxies either have, or have had, a polar ring. 

The angular-momentum vector of a polar ring is perpendicular to the 
angular-momentum vector of the bulk of the stars. A remarkably large 
fraction of SO galaxies display such misalignment of gaseous and stellar 
angular-momentum vectors in an even more extreme form: Bertola, Bu- 
son & Zeilinger (1992) find that in 9 of 33 SO galaxies for which adequate 
data are available, stars and gas rotate in opposite senses, so that their 
angular-momentum vectors are (approximately) anti-parallel. Thus counter- 
rotation of stars and gas is not uncommon in SO galaxies. Since gas shed by 
a galaxy’s stars will inevitably co-rotate with the stars, while accreted gas 
will sometimes co-rotate and sometime counter-rotate, the high incidence of 
counter-rotation in SOs implies that the gas in a significant fraction (2 40%) 
of SO galaxies has an external origin. In §11.3.2 we shall see that a few galax- 
ies contain two stellar disks, which counter-rotate with respect to each other. 
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Such objects presumably form through star formation in a counter-rotating 
gaseous disk. 


8.2.6 Metallicities of disk galaxies 


The metallicity of the ISM of a disk galaxy can be estimated from the 
strengths of emission lines in the spectra of HII regions. The idea under- 
lying this approach is that the strengths of lines due to metals relative to 
those due to hydrogen, increase with metallicity. The most commonly mea- 
sured lines are H@ and forbidden lines of O+, 02+, N+, and S* at 372.7nm, 
500.7nm, 658.3nm, and 672.4nm, respectively — see §8.1.3. The measured 
intensities of these lines have to be corrected for the effects of reddening and 
contamination of the nebular spectrum by starlight, which usually shows Hf 
in absorption. 

Aller (1942) noted that in M33 the forbidden lines of O?+ tend to be 
strong in the HII regions more distant than 20’ from the nucleus, and they 
tend to be weak or absent in regions close to the nucleus. Searle (1971) 
discussed the strengths of emission lines from HII regions in six Sc galax- 
ies, including M33, M81, and M101. He showed that different lines of sight 
through the same HII region produce similar spectra, enabling one to char- 
acterize an entire HII region with a single spectral type, and HII regions 
situated at similar distances from the nuclei tend to have similar spectra, 
but the spectra of regions located at different galactocentric radii are gen- 
erally quite unlike one another. In other words, the spectrum coming from 
an HII region depends systematically on the region’s galactocentric radius 
and on little else. The sense of this dependence is that, in the spectra of 
HII regions which are located far from the center, the lines emitted by O?* 
are strong, whereas the spectra of regions at smaller galactocentric distances 
have relatively strong lines of OF and Nt. 

These trends are illustrated by Figure 8.43 from the work of Henry et al. 
(1992): the top two panels show that in the giant Sbc galaxy NGC 4303 the 
strengths of the lines of both Ot and N* fade with increasing galactocentric 
radius R, but the lines of N fade fastest. The bottom panel of Figure 8.43 
plots the radial variation of the quantity 

+ 2+ 
pee (ODs 0) (8.59) 
I(H8) 
where [(O+) is the intensity of the line doublet at 673nm and J(O?*) is 
the intensity of the doublet at 500nm. Since most O atoms in the nebular 
gas will be either once or twice ionized and the intensities of both H@ and 
the O lines are proportional to electron density, R23 is an indicator of the 
O abundance.” Hence the bottom panel of Figure 8.43 suggests that the 


26 By combining calibrations in the literature of Ro3 versus [O/H], Zaritsky, Kennicutt 
& Huchra (1994) obtain —[O/H] ~ 2.73 + 0.332 + 0.2022? + 0.20723 + 0.33324 where 
x = log R23. 
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Figure 8.43 Nebular emission-line strengths in NGC 4303. Each panel shows as functions 
of radius a ratio of line strengths. The top panel shows that lines of N become relatively 
weaker with increasing galactocentric radius R, while the middle panel shows that the 
ionization level of O increases with R. The bottom panel shows that the O-abundance 
indicator that is defined by equation (8.59) rises with R. [From data published in Henry 
et al. (1992)] 


abundance of O falls from the center of NGC 4303 outwards. Similar results 
have been obtained for about three dozen disk galaxies — see Vila-Costas & 
Edmunds (1992) for references. 

The radial variation of the spectral type of HII regions just described 
is evidence that the interstellar abundances of metals in a disk galaxy tend 
to fall with increasing radius. Unfortunately quantifying this decline is not 
easy. A full account of the relevant theory is beyond the scope of this book 
~ the interested reader is referred to the book by Osterbrock (1989) — but 
the basic principles are as follows. 

The temperature of interstellar gas depends on its metallicity because 
heavy-element ions and the dust grains emit most of the radiation that cools 
the ISM at temperatures below a million degrees. The temperature of the 
ISM in turn controls the strengths of the emitted spectral lines by regulating 
the frequency with which free electrons knock ions into the excited states 


518 Chapter 8: The Interstellar Media of Galaxies 


from which the different spectral lines are emitted. The other factor that 
controls the observed nebular spectrum is the fraction of the atoms of a 
given element which are in each ionization state. This clearly depends on 
what fraction of the atoms is locked up in dust grains — see the discussion of 
depletion in §8.1.1. It also depends on the hardness of the stellar radiation 
field that ionizes the HII region because a harder radiation field will push 
atoms into higher states of ionization. The hardness of the radiation field 
is determined by (a) the nature of the nebula’s exciting stars, and (b) the 
dust-to-gas ratio within the nebula. In particular, if the dust-to-gas ratio is 
high, the hardest exciting photons are likely to be absorbed by dust rather 
than by gas-phase atoms, and ionization levels will tend to be low. Hence 
low ionization levels are a sign of metal-richness. In particular, the rise 
with radius R in the ionization level of O suggested by the middle panel of 
Figure 8.43 is consistent with the outwards decline in metal abundance that 
is implied by the fall with R in the abundance parameter Ro3 (bottom panel 
of Figure 8.43). 

Given the complexity of the physical processes that govern nebular spec- 
tra, it is clear that extensive and high-quality observational data are required 
for the determination of accurate abundances. In particular, it is highly de- 
sirable to measure the relative intensities of a pair of lines such as those of 
S+ at 407 nm, and 672 nm or those of O?+ at 436.3nm and 500.7nm — as we 
saw in §8.1.3, from the ratio of the strengths of the members of such pairs, 
one can accurately estimate the electron temperature in the gas. Once the 
temperature is known, reliable values for the abundances of ions whose line 
intensities have been observed follow straightforwardly by equating the num- 
ber of collisional excitations of each excited state to the observed number of 
radiative decays. Unfortunately, both these temperature-sensitive line pairs 
mentioned above involves a very weak line, whose intensity is often unmea- 
surably small, especially from metal-rich, and therefore cool, nebulae. Hence 
most analyses of nebular spectra determine the temperature from a model 
of the nebula’s energy balance. The parameters of the model, which include 
the effective temperature of the exciting stars and the extinction by dust, 
are adjusted until the predicted spectrum agrees with the observed one. 

In his seminal paper, Searle (1971) used model nebulae to conclude that 
his observations of normal Sc galaxies indicated that a modest increase in 
the O/H abundance ratio toward the galactic centers is accompanied by a 
substantial rise in the N/O abundance ratio; that is, he concluded that the 
nitrogen abundance relative to hydrogen rises more steeply toward galactic 
centers than does the oxygen abundance. Subsequent analyses, using either 
more refined model-fitting techniques or direct electron temperature mea- 
surements, have broadly confirmed Searle’s results. For NGC 4303 Henry et 
al. (1992) estimate 


[O/H] = —0.42R/R, + 0.64, 


[N/H] = —0.6R/R, + 0.55. (8.60) 
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In particular, the O abundance in NGC 4303 falls by roughly an order of 
magnitude over the measured range in R, while the N abundance falls by a 
factor of order of twenty in this range. 

N is largely a secondary element, so the theory of §5.2.1 implies that 
its abundance should be proportional to the square of the abundance of a 
primary element such as O, and thus that [N/H] « 2[O/H]. By contrast, 
equation (8.60) implies that [N/H] « 1.5[0/H]. In §10.4 we shall see that 
abundances in stars near the Sun confirm that the N abundances rises sig- 
nificantly more slowly than as the square of the O abundance. 

Vila-Costas & Edmunds (1992) summarize the literature of abundance 
gradients within galactic disks and explore the degree to which the available 
data can be understood in terms of models of chemical evolution such as 
those discussed in §5.2. One clear result is that barred galaxies show much 
smaller abundance gradients than do non-barred galaxies (Martin & Roy 
1994). Figure 8.44 shows that another clear correlation is between the abso- 
lute magnitude of a galaxy and the metallicity that one obtains by extrap- 
olating [O/H] within its disk to the galactic center: low-luminosity galaxies 
tend to be metal-poorer than luminous galaxies (e.g., Miller & Hodge 1996). 
Since the asymptotic circular speed vu, of a disk galaxy is strongly correlated 
with its absolute magnitude by the Tully—Fisher relation (7.26), it follows 
that [O/H] must be correlated with v.. Indeed, Zaritsky, Kennicutt & Huchra 
(1994) find that this correlation is tighter than that between [O/H] and Mz. 

In §5.3.1 we explored the predictions of the simplest model of chemical 
evolution within an annulus of a disk galaxy — the closed-box model. This 
model predicts that the metallicity is related to the fraction fyas of the local 
mass density that is in gaseous form by 


Z = —p!n feas, (8.61) 


where p is the nucleosynthetic yield [cf. equation (5.37)]. We have seen above 
that in Sa and Sb galaxies, the gas fraction tends to be much lower near the 
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center than at large radii. Hence (8.61) suggests that the metallicity of the 
interstellar gas in these galaxies should decrease from the center outward just 
as the observations suggest. However, Vila-Costas & Edmunds (1992) show 
that the closed-box model cannot explain the data quantitatively, although 
it predicts the observed trend qualitatively. Specifically, they estimated the 
effective yield 

Z 


In fgas 


Pek = — (8.62) 
as a function of radius for each of the galaxies in their sample — if the closed- 
box model applies, peg should be everywhere the same. The derived values of 
Deft are very uncertain because f,as is hard to estimate for two reasons. First, 
the masses of stellar disks are extremely ill determined because of uncertainty 
regarding the contribution of possible massive halos to observed circular- 
speed curves (§§8.2.4 and 10.6). Second, a significant fraction of the gas 
mass will typically be in molecular form and be traced through observations 
of emission by CO. The factor X [equation (8.28)] that connects the intensity 
of this emission to gas mass almost certainly varies with metallicity, and this 
variation is not well understood. 

Vila Costas & Edmunds found that even allowing for the uncertainties 
just mentioned, peg clearly varies systematically both within a single galaxy 
and from galaxy to galaxy in the sense that it tends to be larger at small 
radii and at high metallicity. Specifically, dwarf galaxies such as the SMC and 
LMC, which are metal poor [Z(LMC) ~ 0.4Z, Z(SMC) ~ 0.1Z], imply 
Pert ~ 0.005, while the inner disks of luminous galaxies require peg ~ 0.012. 

It is possible that the yield p varies with metallicity Z, for example 
because the IMF (§5.1.9) depends upon Z. However, in Chapter 10 we shall 
see that the observed metallicity distribution of stars near the Sun does not 
favor this possibility. A more plausible explanation for the inward increase 
of perf Within galaxies is that gas drifts inwards through galactic disks and at 
large radii is replaced by the accretion of metal-poor gas from intergalactic 
space. By carrying metals inwards, this drift of gas through the disk increases 
Pep at small radii and reduces it at large radii. In dwarf galaxies the inward 
drift of cold material within the disk may be counteracted by a galactic wind 
or fountain which carries metal-rich material high above the disk and then 
out to large radii. 


8.2.7 Magnetic fields 


The magnetic-field structures of a few nearby disk galaxies have been probed 
in several ways. The classical approach involves determining the polariza- 
tion of starlight that has suffered extinction by dust — in 83.7.1 we saw that 
the Davis—Greenstein effect causes dust grains to align perpendicular to a 
magnetic field B, and that this causes the electric vector of radiation from 
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Figure 8.45 Contours of constant 
jJsync and the direction of the mag- 
netic field in M51 superimposed 

on an optical image of the galaxy. 
The tendency of B to run paral- 

lel to spiral arms is evident. [After 
Neininger & Herellou (1996) courtesy 
of N. Neininger] 


obscured sources to be aligned with B. This approach suffers from the draw- 
back that radiation reflected by dust is polarized even in the absence of a 
field B. Fortunately, the Davis—Greenstein effect causes the far-IR emission 
from dust to be polarized perpendicular to B and reflected starlight gives 
rise to no complications at far-IR wavelengths.?7 

At radio frequencies magnetic fields have been probed by measuring the 
polarization of synchrotron emission from galactic disks and/or measuring 
the rotation measures of radio sources that lie behind the disks. As we saw 
in §8.1.4, synchrotron emission is polarized perpendicular to the direction of 
the local magnetic field, so from measurements of synchrotron radiation one 
may readily determine the direction of the projection of B onto the plane of 
the sky. Moreover, Figure 3.19 indicates that in the Milky Way the mean 
direction of B lies within the disk. If we assume that the same is true in 
external galaxies, we can infer the three-dimensional direction of B from 
synchrotron observations. 

Figure 8.45 shows contours of constant jsync and the direction of the 
magnetic field in M51 superimposed on an optical image of the galaxy. There 
is a clear tendency of B to run parallel to the spiral arms. Studies of other spi- 
ral galaxies show that this is the normal state of affairs (Beck et al. 1996). In 
the Andromeda galaxy, M31, which differs from M51 in lacking grand-design 
spiral structure, B aligns with many small dust lanes and cloud complexes, 
and at R ~ 20kpc the galaxy is surrounded by a torus of field lines (e.g., 
Ruzmaikin et al. 1990). 

It is much harder to determine the magnitude of B than its direction. A 
widely used method exploits the fact that the intensity jsync of synchrotron 
emission is proportional to the product €B? of the energy density E€ of 
synchrotron-emitting electrons and the square of the strength the magnetic 


27 Hough (1996) reviews determinations of B from the polarizing effects of dust. 
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field, including both ordered and random components. € is estimated by 
assuming that the energy density of all cosmic-ray particles (most of which 
are of too low energy to contribute to the measured synchrotron radiation) is 
equal to the energy density 3B? /to of the magnetic field. This hypothesis of 
equipartition of energy between magnetic field and cosmic-ray particles 
is not self-evidently valid, but the derived field strength B is proportional to 
only the fourth root of the measured quantity jsync, so significant errors in 
the input parameters lead to relatively small errors in the recovered value of 
B. For a sample of 146 late-type galaxies Fitt & Alexander (1993) find that 
Bw~(1+£0.4)nT. In spiral arms B tends to be larger in magnitude but more 
disordered in direction than between spiral arms. In §9.3.2 we shall see these 
results entirely consistent with our understanding of the Galactic magnetic 
field. 

Measurements of the rotation measures, Ry, of background sources pro- 
vide a valuable check on the method of estimating B described above: from 
Ry one can determine the integral of n.By through the disk [equation(8.35)], 
where Bj is the component of B parallel to the line of sight. Since Ry is 
proportional to Bj rather than proportional to B? as jsync is, contributions 
to Ry from different points along a line of sight will have a tendency to 
cancel unless B is highly ordered. Hence estimates of B that are derived 
from measurements of Ry will be lower than those derived from jsync unless 
fields are highly ordered. If, by analogy with observations of the Milky Way 
[equation (9.21)], one adopts n. ~ 3 x 10*m~3, then measurements of Ry 
imply values of B that are smaller than the values of B that are inferred from 
Jsync, just as we expect if B wiggles to the degree that Figure 3.19 suggests. 
This satisfyingly consistent result supports the hypothesis of field—particle 
equipartition from which it is has been derived. 

It is interesting to compare the sign of By at points that lie at equal 
galactocentric distances on opposite sides of a galaxy’s major axis. In most 
galaxies these signs are opposite, as would be the case if field lines simply 
encircled the galaxy. In a few cases the signs are found to be the same. This 
result may indicate a fundamentally more complex field geometry, or it may 
arise because (i) field lines move in and out as they run round a galaxy, and 
(ii) the direction in which B encircles the galaxy is not the same at all radii 
— in §9.3.2 we shall find that this is the case in the Milky Way. If these two 
conditions are fulfilled, the sign of Bj will sometimes be the same on opposite 
sides of the galaxy even though individual field lines run around the disk in 
a fixed sense. 


8.2.8 Star formation in disk galaxies 


We saw in §8.1.3 that the Ha luminosity of a galaxy provides a direct mea- 
sure of the number of ionizing photons that impact on its ISM. At distances 
in excess of some tens of parsecs from the galactic nucleus the vast majority 
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Table 8.2 Median values of the birthrate parameter for galactic disks 
Type Sa Sab Sb = Sbe Sc Scd/Sd Sm/Im 
b <0.07 0.17 0.33 0.84 0.99 0.69 1.67 

SOURCE: From data published in Kennicutt, Tamblyn & Congdon (1994) 


of ionizing photons come from massive, young stars. If we assume some dis- 
tribution of these stars over spectral type, we can determine the total mass 
of gas in these stars from the rate at which they produce ionizing photons. 
If we assume, moreover, that the rate of formation of massive stars has been 
approximately constant over the relatively short lifetimes of these objects 
(S$ 10Myr — see Table 5.2), we can infer the rate at which massive stars 
are forming from the numbers currently present. Thus, observations of Ha 
luminosity of a galactic disk lead rather directly to the rate at which massive 
stars are forming within it. In §5.4.2 we defined the stellar birthrate parame- 
ter b as the ratio of the present star-formation rate to its value averaged over 
the galactic lifetime, and saw that in models 6 is tightly correlated with both 
the equivalent width EW(Ha) of the Ha line and B — V color (Figure 5.22). 

Kennicutt, Tamblyn & Congdon (1994) estimated b for the disks of 210 
galaxies from these relations. They found that in Sa and Sab disks, b is 
always small (b S 0.4). The spread of b values at a given Hubble type is 
generally large, but there is a clear tendency for larger values of b to occur at 
later Hubble types. This trend can be clearly seen in Table 8.2, which gives 
median values of b for each Hubble type. The large scatter in measured b 
values presumably arises because in any given disk, the star-formation rate 
goes up and down in an irregular way. The median values listed in Table 8.2 
suggest that, averaged over a sufficient period, the disks of galaxies later 
than Sb have been forming stars at a fairly constant rate throughout their 
lifetimes. By contrast, star formation has effectively ceased in the disks of 
Sa and Sab galaxies. 

From the position of their galaxies in the [B — V, EW(Ha)] plane, Ken- 
nicutt, Tamblyn & Congdon (1994) argued that the IMF in these systems 
has to be about as rich in massive stars as the Salpeter IMF predicts. In par- 
ticular, they suggest that if star formation followed the Scalo IMF, EW(Ha) 
would be too small at a given B — V color. Unfortunately, this conclusion 
is sensitive to assumptions that must be made about the history of star 
formation in disks. 

If one knows both the IMF and the rate at which massive stars form, 
one can predict the rate at which gas is being locked up in long-lived stars, 
and compare this to a disk’s present stock of interstellar gas. When proper 
account is taken of the rate at which dying stars return gas to the ISM, Ken- 
nicutt, Tamblyn & Congdon (1994) find that spirals can sustain their present 
star-formation rates for 5 — 15 Gyr without accreting additional material. 

The same techniques that are used to determine the global star-formation 
rates in galaxies of different Hubble types can be used to estimate the ra- 
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Figure 8.46 The total gas density (triangles), the blue continuum surface brightness 
(squares) and the Ha intensity (hexagons) in NGC 4254 and NGC 2841. The zero-points 
of the vertical scales are arbitrary. [From data published in Kennicutt (1989)| 


dial distribution of star formation within a given galaxy. Figure 8.46 shows 
the results of such an investigation for an Sb galaxy, NGC 2841 (cf. Fig- 
ure 8.26), and an Sc galaxy, NGC 4254. Outside the core of each galaxy, 
the surface density of Ha emission (open hexagons) is roughly proportional 
to the surface-brightness in blue continuum light (filled squares), just as we 
would expect if the number of stars forming at each radius is proportional 
to the number of stars that have already formed there. In both galaxies this 
proportionality breaks down at small radii: here the star-formation rate is 
currently peculiarly small. 

Since stars form through the gravitational instability of the ISM, Schmidt 
(1959) early speculated that the star-formation rate is proportional to a 
power of the surface density of the ISM. Where the combined surface density 
of atomic and molecular hydrogen is sufficiently high, Kennicutt (1989) finds 
that the Ha intensities of every galaxy in a sample of seven can be fitted by 
the Schmidt law 


I(Ha) « 513 (8.63) 


gas* 


The Ha intensity invariably falls below the value predicted by this law at 
large radii and the largest radius at which the law holds coincides, to within 
the substantial uncertainties, with the radius at which the gas density falls 
below ~ 0.63 times the critical value 


(8.64) 


where « is the epicycle frequency (§3.2.3 of BT) and v, ~ 6kms~! is the 
sound speed in the gas. In BT 86.2.3 it is demonstrated that a gas disk is 
unstable to axisymmetric disturbances only when its surface density exceeds 
Lerit- Thus, the observations suggest that the ISM is unstable even at slightly 
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Figure 8.47 Contours of con- 
stant X-ray surface brightness in 
NGC 720 overlaid on the opti- 
cal image of the galaxy from the 
Digital Sky Survey. Adjacent con- 
tours are separated by a factor 

of 4 in X-ray surface brightness. 
[The X-ray contours are from the 
analysis of ROSAT data by Buote 
& Canizares (1996) courtesy of 
D. Buote] 


lower surface densities, presumably because the gas represents only part of 
the mass of the disk.?8 

Although the gas density rises above 0.63.¢r; in the star-forming regions 
of disks, it does not do so by a big factor, even when Xz. varies by more than 
an order-of-magnitude. This finding suggests that if &,,, ever significantly 
exceeds Lorit, Star formation depletes the ISM rapidly until gas ~ Uerit- 

In a few galaxies, especially early-type ones, .ri¢ rises more rapidly near 
the center than does Ugas, With the result that Lga, falls below the threshold 
for star formation at both small and large radii. In these cases there is 
usually a central depression in the Ha profile, just as theory predicts. 


8.3 The ISM in elliptical galaxies 


8.3.1 X-ray emitting plasma 


Whereas the bulk of the ISM in spiral galaxies is in the form of HI and Ho, 
the ISM in ellipticals consists primarily of hot (T 2 10°K) plasma. This 
plasma produces X-rays by bremsstrahlung, bound-—free, and line emission 
processes. Figure 8.47 is an X-ray image of NGC 720 obtained with the 
ROSAT satellite. From the X-ray spectrum of this object one infers that the 
temperature of the plasma, T ~ 7 x 10°K (Buote & Canizares 1997). This 
temperature is rather higher than the characteristic kinetic temperature that 
one associates with a population of stars with line-of-sight velocity dispersion 


28 For a quantitative discussion of how a stellar disk affects the stability of an embedded 
gaseous one, see Jog & Solomon (1984). 
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Figure 8.48 X-ray and optical 
brightness profiles of NGC 720. The 
X-ray data are shown as crosses, 
which indicate the radial averaging 
and uncertainty in the data. These 
data came from ROSAT HRI and 
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E normalized to agree at R = 30 arcsec. 
[ The optical profile is a combined fit 
10-2 to the HST data from Lauer e¢ al. 


(1995) and the ground-based pho- 
tometry of Peletier et al. (1990). 
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o, T, = 2.9x 10°(o¢/200kms~')? K, since for NGC 720 o ~ 200kms~+ (Fried 
& Illingworth 1994). On account of the high interstellar pressure to which 
this high temperature gives rise, the plasma is approximately spherically 
distributed in the galaxy’s potential well, rather than organized into a thin, 
centrifugally supported disk as in a spiral galaxy. 

Figure 8.48 shows that in NGC 720 the X-ray surface brightness is al- 
most exactly proportional to the optical surface brightness. This coincidence 
is characteristic of elliptical galaxies. At first glance it is rather surprising, 
since the optical emissivity of a galaxy, jopt, is directly proportional to the 
stellar density, n,, whereas its X-ray emissivity, jx, is proportional to the 
square of the plasma density, ne, (§8.1.2). Thus, if jx « jopt, then ne « ni! >. 
the gas density falls off with radius more slowly than the stellar density in 
such a way that it conspires to make the brightness profiles similar. This 
difference occurs because the plasma temperature is higher than the kinetic 
temperature of the stellar distribution. In fact, it is straightforward to show 
that if n, « r~® then n, x r~°?, where 3 = T,/T is the ratio of the stel- 
lar and plasma temperatures — see Problem 8.11. Thus, since generically 
JX & Jopt in elliptical galaxies, we know that 6 ~ 0.5 in these systems. 

Figure 8.49 shows the degree to which the X-ray luminosities of the el- 
liptical galaxies that were detected by the Einstein X-ray observatory are 
correlated with (i) their optical, and (ii) radio-continuum luminosities. At 
low luminosities Lx and Lg are fairly tightly correlated and roughly pro- 
portional to one another. The X-ray luminosity at Lg < 10°” W is proba- 
bly dominated by the emission from mass-transfer binaries rather than the 
ISM?®; the spectra of low-luminosity galaxies resemble the rather hard spec- 
tra of low-luminosity X-ray binaries in the Milky Way, while the spectra of 
luminous galaxies are characteristic of cooler, more rarefied plasma. Above 


29 X-ray binaries probably dominate the X-ray emissions of all disk galaxies (Long & 
van Speybroek 1983) 
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Figure 8.49 Correlations between the luminosities of E and SO galaxies in the optical 
(B), X-ray (Einstein) and radio (5 GHz) bands. [From data published in Fabbiano, Gioia 
& Trinchieri (1989)} 


Lg ~ 10°” W, where the ISM is dominant, the mean X-ray luminosity in- 
creases roughly as L%, but with considerable scatter. 

The right panel of Figure 8.49 shows that the radio-continuum and X- 
ray luminosities of elliptical galaxies are correlated. As in the case of the 
radio—IR correlation of Figure 8.22, the natural inference to draw from this 
correlation is that there is a causal connection between the X-ray emitting 
plasma and the radio-continuum emission. This point is highly controversial, 
however (e.g., Binney 1996). 

By modeling the radial X-ray brightness and temperature profiles one 
can infer the mass of the X-ray emitting plasma in an elliptical galaxy. The 
inferred masses range up to 101 Mo, so the most X-ray luminous ellipticals 
have as large a fraction of their mass in gas as a typical Sc galaxy. Other 
ellipticals, especially low-luminosity ones, have very small gas-fractions. We 
saw in §5.4 that the rate at which an elliptical’s dying stars pump gas into 
the ISM can be calculated quite reliably. One can easily show that if all this 
gas were to accumulate in the galaxy at the temperature T ~ 27, similar to 
that observed, the galaxy would lie close to the top of the Lx — Lg correlation 
of Figure 8.49 — see Problem 8.12. 


8.3.2 Cool gas in ellipticals 


The great majority of elliptical galaxies are much poorer in cool gas than a 
spiral galaxy of comparable luminosity. This poverty, combined with the fact 
that comparatively few giant ellipticals lie conveniently close to the Milky 
Way, makes them hard to observe at 21cm, 2.6mm or the far infrared and 
hinders quantification of their cool ISM content. 

The uppermost curve in Figure 8.41 summarizes the incidence of HI 
in elliptical galaxies. It shows that three quarters of elliptical galaxies 
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Figure 8.50 The dust lane around 
NGC 5266. This negative image 

is based on a plate taken with the 
Anglo-Australian Telescope and has 
been processed to heighten the dust 
lane. [Courtesy of D. Malin] 


have Myy/Lp < 1.5 x 10-3 Mo Tees while only 5% have My;:/Lg > 
0.05 Mg Le’. 

CO has been detected in several ellipticals (Knapp 1990). Roughly 40% 
of a sample of bright ellipticals were detected by IRAS Knapp et al. (1989) 
but mostly at very low values of Leo ,m/Lg. About 40% of bright ellipticals 
have dust lanes. These consist of cool, dusty gas and the gas frequently 
shows Ha or similar optical emission lines (Bertola 1987, Bertola 1990). 

The origin of cool gas in ellipticals is unclear. One would expect gas 
ejected by dying stars rapidly to join the pervasive hot plasma, although 
measurable IR or CO radiation might be emitted by shells of gas between 
their ejection by stars and their absorption into the general plasma. Some 
cool gas may be material that has cooled from the hot plasma. One would 
expect this gas to be very centrally concentrated. 

Observations of dust lanes in ellipticals such as that shown in Figure 8.50 
suggest, however, that they do not comprise material which has condensed 
out of the galaxy’s ISM. Notice that the dust lane of Figure 8.50 lies far 
out in the optical image and runs around the apparent minor-axis of the 
galaxy. Observations of emission lines from the lane’s cool gas show that 
the gas is rotating around the galaxy’s major axis, as one might expect from 
Figure 8.50. By contrast, stellar absorption-line spectra show that the stars 
in this galaxy rotate around the galaxy’s minor axis. Thus in NGC 5266 
the angular-momentum vectors of the dust and the stars are perpendicular. 
This proves to be the arrangement in about half of dust-lane ellipticals.°° 
In other cases the angular momentum vectors appear to be inclined at an 
arbitrary angle. Sometimes they are anti-parallel, so that the stars circulate 


30 A word of caution is in order here regarding the statistics of dust-lane orientations: 
there is a danger that ellipticals that have dust lanes along their major axes will be 
classified as SO galaxies. 
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Figure 8.51 Galaxies with active nuclei are on average 1000 times 
more luminous at radio wavelengths than normal] galaxies of the 
same IR luminosity. [After Knapp (1990)] 


one way and the gas another! Only in a small minority of cases are the two 
vectors parallel. This morphology is hard to reconcile with the gas having 
been ejected by the galaxy’s stars. Moreover, the presence of dust in the gas 
and the large radius of the dust lane are both hard to reconcile with the gas 
having condensed out of the hot plasma; dust grains would be rather quickly 
sputtered out of existence in the X-ray emitting plasma. Neither finding is 
strange if the gas has fallen in from outside. 

If dust lanes are formed from infalling material, there are two reasons 
why it is natural for strangely oriented dust lanes to be common in ellipticals 
but unknown amongst spirals. First, the conventional gaseous disk of a spiral 
will absorb any infalling material that comes in with an anomalous angular 
momentum vector — the orbital plane of such gas will intersect the disk’s 
plane, and at the points of intersection infalling material will collide and 
merge with more massive clouds in the disk. Second, it is widely believed 
that many elliptical galaxies have been formed in a merger. If one of the 
merging galaxies is a spiral, part of its disk is liable to be flung out in a tidal 
tail. Much of this tail will fall into the elliptical which evolves out of the 
merger remnant over a period of several Gyr. Thus of all galaxies ellipticals 
may be especially well placed to possess a steady stream of infalling dusty 
gas (Schweizer 1990). 

Figure 8.51 shows that although the range of galaxies that conform to 
the correlation shown in Figure 8.22 is broad, it is not universal, by showing 
that quasars (open circles) and radio galaxies (filled circles) lie on a different 
correlation line from that shown in Figure 8.22. These galaxies with active 
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nuclei have roughly three orders of magnitude more radio luminosity per 
unit IR luminosity than the normal and star-bursting galaxies. It is natural 
that galaxies with active nuclei should lie to the right of normal galaxies in 
Figure 8.51: in their nuclei they have an additional powerful source of cosmic 
rays and thus radio luminosity. The interesting and poorly understood fact 
is that this luminosity is still correlated with infrared luminosity. 


8.4 Intergalactic gas 


The space between galaxies is not empty. The existence of intergalactic gas 
is important because cool gas is liable to fall into a neighbouring galaxy, 
while very hot gas can sweep the ISM out of a galaxy that moves through it. 
In either case intergalactic gas has the potential profoundly to modify the 
nature of nearby galaxies. 


8.4.1 The Magellanic Stream 


The Magellanic Stream consists of a series of six elongated HI clouds that 
are aligned with each other to form a swath of 21-cm emission that stretches 
over more than 60° in the southern sky. One end of this swath merges 
with the diffuse HI envelope around the Magellanic Clouds (Figure 8.52). 
This structure has been successfully modeled as gas that has escaped from 
the Small and Large Magellanic Clouds, and is now trailing the Clouds in 
their orbit about the Milky Way. This model, which was introduced by 
Murai & Fujimoto (1980), predicts that the Clouds are currently near the 
pericenter of their orbit and are moving eastwards. The proper motion of 
the Clouds was subsequently found to be wa = (1.20 + 0.28) masyr~? and 
Hs = (0.26 + 0.27) masyr! per century (Jones, Klemola & Lin 1994) in 
agreement with Murai & Fujimoto’s model. 

Early 21-cm surveys of the Milky Way revealed HI emission at high lati- 
tudes and, at low latitudes, at velocities that differed by more than 50kms~! 
from those associated with simple differential rotation (Verschuur 1975). The 
gas responsible for this emission does not form part of the Galaxy’s differen- 
tially rotating disk. There is a preponderance of these high-velocity clouds 
with negative heliocentric velocities, and less than ~ 3% of them are found 
at heliocentric velocities of more than 300kms~! (Wakker 1991). Both of 
these results indicate that high-velocity clouds are either associated with the 
Milky Way or with the Local Group. 

The Local Group is not the only galaxy group that is known to contain 
intergalactic HI. A spectacular example of intergalactic gas is provided by 
the M81/M82/NGC 3077 group, in which neutral hydrogen of surface density 
Nyt > 107? m~? envelops several adjacent galaxies. 

We have seen that 21-cm measurements do not provide a good way of 
detecting gas whose total surface density (Ngi+Nu, ) is less than ~ 1074 m7?. 
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Figure 8.52 The Magellanic 
Stream. The contour surrounding 
the Clouds is that on which the HI 
surface density equals 5 x 1023 m~?. 
[After Mathewson, Schwarz & 
Murray (1977)} 


Studies of absorption lines in the spectra of extragalactic sources enable us 
to detect much thinner layers of gas in the same way that studies of the 
ultraviolet spectra of bright Galactic stars enable us to probe the ISM (see 
§8.1.1). Unfortunately, no extragalactic source is bright enough to have been 
studied by the Copernicus satellite, so until the advent of the HST it was 
impossible to study nearby intergalactic gas as thoroughly as the ISM has 
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been studied by this technique. In fact, astronomers had to choose between 
studying nearby intergalactic gas through lines of trace ions in particular 
ionization stages — as we saw in §8.1.1, the physical interpretation of such 
lines is problematic ~ and studying intergalactic gas at redshifts z > 2.5 large 
enough to allow the Lya line to be redshifted into the optical bands. 

One very important result which did emerge from studies of absorption 
by ions such as C?+ was that the gaseous disks of typical bright galaxies have 
to extend to R ~ 30h~!kpc. The simplest argument for this great extent 
involves determining the number of such lines per unit velocity interval in 
a typical quasar spectrum and calculating the number of galactic disks our 
line of sight would intersect over the corresponding distance in space if each 
bright galaxy had a disk of radius pkpc. The number of lines observed and 
disks intersected agrees for p ~ 100 kpc (Spitzer & Bahcall 1969). This basic 
argument is confirmed by an experiment carried out by Bergeron & Boissé 
(1991): for each low-redshift absorption line in a quasar spectrum, try to 
find the responsible galaxy. From the redshift of the absorption line one can 
estimate the apparent magnitude of the galaxy. Then candidate galaxies are 
sought in the field around the quasar and finally their redshifts are measured. 
This work has been confirmed and extended by Steidel, Dickinson & Persson 
(1994), who concluded that QSO absorption lines are formed in disks that 
extend to ~ 30h71!(L/L*)°? kpc. These disks belong to luminous (L ~ L*) 
galaxies.3! 

UV observations with HST make it possible to study the nearby inter- 
galactic gas through Lya absorption, which enables us to detect gas with 
column densities Nyy 2 10/8 m~?.32 Spectra of nearby quasars reveal a sur- 
prisingly large number of clouds with column densities 10'® m=? < Nyy < 
107° m~? (Bahcall et al. 1993). 

X-ray observations reveal that most rich clusters of galaxies are filled 
with plasma with temperatures T > 107K. Such plasma within a cluster 
is called the intracluster medium. The mass of the intracluster medium 
is enormous - it exceeds the total mass in the luminous parts of the clus- 
ter galaxies by a factor of several. Since its mass is so great, at most a 
minority of this material can ever have been in a galaxy. The intracluster 
medium is not a simple primordial hydrogen—helium plasma, however: ob- 
servations of X-ray spectral lines reveal that it contains significant numbers 
of heavy-element ions. It seems likely that these heavy elements were formed 
in galaxies and then transferred to the intracluster medium either by being 
blown out of the galaxy by the cumulative effect of many supernovae, or by 
being stripped from galactic disks as their parent galaxy sped through the 
intracluster medium. 

What impact do these various detections of intergalactic gas have on the 
study of galaxies themselves? This question raises the further question: is 


31 See 84.1.3 for the definition of L*. 
32 Column densities Ny, 2 107° m7? are required to generate detectable absorption 
lines of ions such as Mg*. 
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the observed gas entering or leaving galaxies? There is significant evidence 
that much of the gas observed in groups of galaxies has been stripped from 
galaxies. In the case of the Magellanic stream, the model of Murai & Fu- 
Jimoto (1980) indicates that dynamical interaction between the two Clouds 
and the Milky Way led to gas detaching itself from the Clouds to form the 
stream. Similarly, a great tail of HI near NGC 3628 in the Leo group of 
galaxies has been successfully linked by dynamical modeling to a stellar tail 
that is observed in this system (Haynes, Giovanelli & Roberts 1979). We 
have also seen that some part of the hot gas in clusters of galaxies has prob- 
ably been stripped from cluster galaxies. Yet one galaxy’s loss is likely to 
be another galaxy’s gain. In clusters of galaxies the hot plasma may be con- 
densing into stars in the giant galaxies that are often found at the centers of 
clusters. The gas of the Magellanic stream may eventually stream into the 
disk of the Milky Way. Similarly gas that currently forms the polar rings 
around SO galaxies is likely eventually to sink into the underlying galaxy. 
In all these cases the orientation and structure the existing stellar disks will 
be profoundly affected by the acquisition of polar gas (see Problem 8.13). 
Finally, prolonged infall of streamers of gas into elliptical galaxies may sus- 
tain a low level of star formation within these galaxies, with significant effect 
upon the galaxy’s colors. 


Problems 


8.1 Calculate the width contributed to the quantity f(v) [defined by equa- 
tion (8.19)] by thermal motions of hydrogen atoms at temperature 80K. 


8.2 Show that the slope of a curve of growth (§8.1.1) tends to 0.5 when the 
equivalent width of the line is much greater than its Doppler width. 


8.3 The Arecibo telescope has a collecting area of 7 x 104 m?, and is often 
used to observe the 21-cm line at a resolution of 5kms~! per channel. Cal- 
culate the power received in a single channel when the optical depth along 
the line of sight is 7, = 0.2 and the material observed has spin temperature 


T = 80K. 


8.4 Estimate the power received in a single channel of the type specified in 
the previous question by simply calculating the rate of spontaneous decays 
of excited atoms in the beam. Explain why this estimate is accurate even 
though the vast majority of photons emitted are absorbed before they reach 
the observer. (Note that this last statement is true even for the moderate 
value of 7, specified.) 


8.5 Radio astronomers frequently report specific intensities, [,,, in units of 
mJy arcsec”. Show that if J, is measured in these units and the wave- 
length, 4, is in meters, then the antenna temperature is given by T = 


1.54 x 104\71, K. 
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8.6 A galaxy at a distance DMpc is observed to have 21-cm flux S(v) Jy 
at velocity vkms~!. Given that the 21-cm line is optically thick, show that 
the mass of HI in this galaxy is My1/ Mo = 2.36 x 10°D? f dv S. 


8.7 Assuming the average absorption cross-section per dust grain, Kg, [equa- 
tion (8.42)] varies as A~%, and that all grains have a common temperature 
T, show that the total emissivity is proportional to T4+®. For a = 1.5, find 
the value of hc/(kgTA) at which the dust’s emissivity, K.B.(T), peaks. 


8.8 Calculate the rate per unit surface area at which a grain is heated 
by impacts of H atoms; assume that impacts are totally inelastic but do 
not cause H atoms to stick. Find the grain temperature at which radiative 
cooling would be in equilibrium with such collisional heating. Explain why 
this temperature is much lower than the observed temperatures of interstellar 
dust. 


8.9 The total gaseous column density in a galactic disk is 
N(R, 6) = No[1 + 0.2 cos(kR + 2¢)]e7*/*e, (8.65) 


where k = 3/Rq. Plot the HI column density as a function of R for ¢ = 0 and 
@ = 90° under the assumption that the ionized layers above and below the 
disk have uniform column density Ny = 0.1Npo. In the light of these plots, 
comment on the finding that at large galactocentric radii 21-cm observations 
yield much more regular velocity contours than contours of Nyz. 


8.10 Consider gas that is ejected from stars in an elliptical galaxy in 
which the stars move isotropically with one-dimensional velocity dispersion o. 
Given that radiative losses are unimportant and that the speeds of individual 
stellar winds are negligible compared with o, show that the equilibrium tem- 
perature of interstellar gas in the galaxy is T, = 2.8 x 10°(0/200kms~*)? K. 
(Assume that the mean molecular weight in a fully ionized plasma is w = 0.6 
— see Problem 5.2.) 


8.11 Suppose that the stellar density in the elliptical galaxy of Problem 8.10 
falls with radius as n, « r~°. Show that the density of the galaxy’s isother- 
mal ISM will vary as n «x r~°?, where 8 = T,/T. 


8.12 Assume that gas accumulates for a Hubble time in the galaxy of Prob- 
lem 8.10 at a temperature T ~ 2T,, similar to that observed. Show that the 
galaxy should lie close to the top of the Lx — Lg correlation of Figure 8.49. 


8.13 The mass of HI in the Magellanic Stream is probably ~ 10° Mo. 
Show that this gas probably has more angular momentum about the Galactic 
center than all the gas in the Milky Way. 


9 
The Milky Way’s ISM 


In §8.1 we discussed the techniques used to track interstellar material and to 
determine its physical characteristics. We drew upon many results of studies 
of the Milky Way’ ISM because we can study nearby interstellar material in 
very much more detail than the interstellar media of external galaxies. In 
fact many of the techniques mentioned in §8.1 can currently only be used 
to study the local ISM. One might have hoped that from this great wealth 
of information on the properties of the local ISM would have flowed a clear 
and definitive picture. Sadly this is not the case; studies of the Milky Way’s 
ISM suffer badly from the syndrome of not being able to see the wood for 
the trees. 

The problem is two-fold. First, the wealth of the available information 
gets in the way of sweeping generalizations of the sort that may be plausibly 
made about external galaxies, about which we know so much less. Second, 
we have the misfortune to be located in the plane of the Milky way, and 
suffer dreadfully from the particular bugbear of astronomy: ignorance of the 
distance to which we are peering. In this chapter our aim is to synthesize our 
knowledge of the local ISM into a more-or-less coherent, three-dimensional 
picture. We start with an analysis of the kinematics of Galactic rotation, 
since this provides us with much of our knowledge of the distances to gas 
clouds. 
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Figure 9.1 Geometry of a disk of 
material in circular rotation. 


9.1 The kinematics of differential rotation 


We saw in §8.2 that line-observations give rise to a ‘data cube,’ and that, 
for an optically thin line, this cube specifies the amount of material moving 
at each velocity down each line of sight. When using line observations of 
material that is confined to a thin layer near the plane to study the large- 
scale structure of the Milky Way, it is natural to average the data for a small 
range of values of galactic latitude b around b = 0. The slice through the data 
cube that is isolated by this averaging procedure is conveniently displayed as 
a longitude—velocity plot [(J,v) plot] like that shown in Figure 9.13 below. 
Our first task is to learn to interpret such plots. 

Our discussion is significantly simplified if we work with velocities rela- 
tive to the local standard of rest (LSR). This is the current velocity of a 
fictional particle that moves around the plane of the Milky Way on the closed 
orbit in the plane that passes through the present location of the Sun.’ If 
the Milky Way is axisymmetric, this orbit is circular, otherwise it is oval. 
In §10.6 we shall see how the Sun’s velocity relative to the LSR may be 
determined. Clearly, once it is known, it is trivial to convert any measured 
heliocentric velocity into a velocity relative to the LSR. In this chapter we 
shall work exclusively with velocities relative to the LSR. 


9.1.1 The naive (/,v) plot 


Consider the (1, v) plot that we would observe if all the gas were in circular 
rotation. Let R. be the position vector of material with respect to the Galactic 
center, and let the angular velocity of rotation of this material be Q(R). 
The velocity of this material is then v, = 2 x R. When observed from the 
particle which defined the LSR, whose rotation velocity is vo = Q(Ro) x Ro, 
the line-of-sight velocity, vj}.,, of this material is the projection of the velocity 
difference v, — Vo on to the vector (R — Ro) that runs from the Sun to the 


1 It is conceivable that more than one orbit could fulfill our defining condition for the 
LSR. In this case it would be natural to choose the most nearly circular orbit. 
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Figure 9.2 The traces in the (I, v) 
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-0.5 plot of three rings. The radii of the 
rings are r = 0.9Ro (full curve), r = 
0.1Ro (dotted curve) and r = 1.5Ro 

= : ST areas ’ sae a6 = (dashed curve). The circular speed 
l in the disk is unity at all R. 
material: R_R 
Vos = Z>—a 7 * [A(R) x R — Q(Ro) x Ro]. (9.1) 
|R. — Ro| 


Using the vector identities a. (a x b) = 0 and a- (bx c) = b- (c x a), we 
can rewrite equation (9.1) as 


[(R) - 2(Ro)] - (Ro x R) 
IR — Rol | 


In the notation of Figure 9.1, Ro x R = —RoRsinah, where ni is the unit 
vector that points perpendicular to the disk. Also, by the law of sines, 
sina/|R — Ro| = sinl/R. Finally, since the Milky Way rotates in the sense 
shown in Figure 9.1 when viewed from the northern hemisphere, we have 
Q = —|Q\n. Inserting these results into equation (9.2), we have finally 


(9.2) 


Vios = 


Vtos(1) = [QA(R) — Q(Ro)] Ro sinl. (9.3) 


Now imagine that all the gas in the Milky Way were concentrated into 
a ring of radius R. Then equation (9.3) states that, along any line of sight 
that intersects the ring, we will detect material at just one velocity, and, 
moreover, that this velocity will be proportional to sinl. Figure 9.2 shows 
the traces in the (I, v) plot of three such rings. Each is a section of a sine 
curve. If R/Rpo is significantly smaller than unity, the section is small and 
approximately straight; quantitatively, it runs from | = —Imax to 1 = Imax, 
where Imax = arcsin(R/ Ro) is half the angular extent of the ring. If R/Ro 
is less than but of order unity, the section is distinctly curved. If the ring 
lies outside the solar circle, then it traces a full sine curve in the (I, v) plot. 
For any value of R/ Ro the slope of the trace at the origin is proportional to 
the difference 2.(R) — (Ro). For any plausible circular-speed curve, 2(R) 
increases inwards, and therefore this slope is largest for the smallest rings. 
As R/Rpo increases from small values towards unity, this slope continuously 
declines, and is zero for R/Ro = 1; points on the solar circle do not move 
relative to the LSR. For R/Ro > 1 the slope of the sine curve near the 
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Figure 9.3 Grayscale of optical depth in the (i,v) plane. As 
for Figure 8.12, the number density of hydrogen atoms has been 
assumed to equal 2 x 10°m~? at Ro and to vary with R as 
exp(—R/2Ro). The circular speed is given by ve(R) = (R/Ro)®1 x 
210kms~!. The sine curve vjo; = —vc(Ro) sin! (shown dashed) lies 
just outside the boundary of the occupied band. Figure 9.4 shows a 
section of the surface r(i,v) along the vertical dotted line / = 20°. 


origin is opposite to that for R/Rp < 1, and the absolute value of the slope 
increases steadily as R/Rpo increases. Since, for any plausible circular-speed 
curve, 2(R) — 0 as R — ov, it is clear from equation (9.3) that the amplitude 
of the sine curve tends in the limit R — oo to minus the circular speed at 
the solar circle, RoQo(Ro). 

The Milky Way’s disk may be considered to be made up of infinitely 
many rings, so much of any observed (1, v) plot is occupied. Material inte- 
rior to the solar circle should occupy the quarters 0 < 1 < 90°, u,, > 0 and 
—90° <1 < 0°, v0, < 0 at top left and bottom right of Figure 9.2. Gas out- 
side the solar circle should occupy the remaining two quarters of the figure. 
We shall see that this theoretical division of the (I, v) plane is only partially 
confirmed by observation, and we speak of material that lies outside the 
quarters specified above as being at forbidden velocities. We mentioned 
above that the central slope of the curve associated with a ring of radius 
r can be large when r is small, but is never greater than RpQ(Ro) when 
r > Ro. Hence for | > 0 velocities less than —R pQ(Ro) sin! are strongly 
forbidden in the sense that material should not be found at such velocities 
regardless of whether it lies inside or outside the solar circle. On the other 
side of the Galactic center there is an equivalent range of strongly forbidden 
positive velocities. 

Figure 9.3 shows the magnitude of the optical depth 7 of the 21-cm line 
in the (l,v) plane for a model disk in which all material moves on perfectly 
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circular orbits. As we anticipated in the discussion above, 7 is non-zero only 
in two broad arcs that are bounded on one side by sections of a sine curve. 
When we point a radio telescope to the Milky Way at some longitude 1, 
the resulting spectrum will show emission at velocities corresponding to the 
intersection in Figure 9.3 of the vertical line / = 1, with the populated band. 
The vertical dotted line in Figure 9.3 and the plot of optical depth versus 
Vios Shown in Figure 9.4 illustrates the case 1; = 20°. Since this line lies in 
the first quadrant (0 < 1; < 90°), it runs through the occupied band from 
Ulos = —Uc(Ro) sin l, below to 


Up(la) = [O(re) — Qo] Ro sinly (9.4) 


above, where r; is the radius of the smallest ring (with the largest value of 9) 
that can be seen along that line of sight. Clearly the line of sight is tangent 
to this ring, and simple geometry demonstrates that 


ry, = Rosin ly. (9.5) 


The point at which the line of sight touches the ring of radius r; is called 
the tangent point and the quantity v(t) is called the terminal velocity. 
Empirically, the terminal velocity can be determined by examining an ob- 
servational analog of Figure 9.4: it is the velocity of the abrupt decrease in 
emission at v > 0 for / > 0 and v < 0 for! < 0. In Figure 9.4 of) lies near 
0.6v.(Ro). Combining equations (9.4) and (9.5) we have 


ve(re) = oi) (y) + ve(Ro) sinh. (9.6) 
Since u(t) is in principle readily determined for any value of 1, between 0 and 
90°, this relation enables us to determine the rotation curve v.(R) for R < Ro 
provided some means can be found to determine v,(Ro). The data for 1 < 0 
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Figure 9.5 Lines of sight at |l| < 90° cut each 
ring of the inner disk twice but rings of the outer 
disk only once. At 1 > 0 the inner rings con- 
tribute emission at v > 0, while outer rings con- 
tribute emission at v < 0. 


and | > 0 yield independent determinations of v,(R), and comparison of 
these two determinations provides a valuable check on the validity of the 
derived circular-speed curve. 

Determining v.(Ro) is extremely difficult and we shall defer full discus- 
sion of this problem to §10.6. However, we have already seen that v.(Ro) is 
in principle simply the amplitude of the sine curve that forms one boundary 
of the occupied band in Figure 9.3. We shall discuss this idea further in 
89.2.1. 


Radii and distances from the (I,v) plot Equation (9.6) provides 
a means of determining the circular-speed curve v.(R) without any prior 
knowledge of the distances to the material that fills the (l,v) plot from 
which the circular-speed curve is determined. After we have determined 
the circular-speed curve and therefore the function Q(R), we can hope to 
infer the radius of material observed at any given value of vj, by solving 
equation (9.3) for r. Since 02(R) is almost certain to be a monotonic func- 
tion, r is in principle uniquely determined by (I, vjos). Of course, deviations 
of the measured values of v|,, from the true values will lead to errors in the 
values of r inferred from these measurements. The resulting errors in r are 
liable to be large for material that lies outside the solar circle, because in 
this region an infinite range in r corresponds to a finite, and, at many values 
of 1, narrow range in vjo9,. Consequently a small error in the measured value 
of Vjog will lead to a large error in the recovered value of r. Inside the solar 
circle the ranges in r and vj, are both finite, so the fractional error in r will 
not be large if vj, is accurately known. In this connection, it should be noted 
that the error in vjo, is likely to be dominated by the effects of non-circular 
velocities to be discussed below, rather than by inaccuracies in determining 
the central frequency of the observed line profile. 

Once we have determined r as described above, we often wish to go on 
to determine the distance d-of the observed material from the Sun. Any 
given line of sight intersects rings that lie outside the solar circle just once. 
So d follows immediately from r when the observed material lies in bottom 
left or top right quarters of Figure 9.3. By contrast, any given line of sight 
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intersects rings that lie inside the solar circle either twice or not at all — see 
Figure 9.5. Consequently, when material lies in the top left or bottom right 
quarters of Figure 9.3, two values of d are possible. It is sometimes possible 
to resolve the ambiguity between these values by seeing how far the emission 
in question extends in |b|: if the nearer value of d is the correct one, the 
material will subtend a bigger solid angle at the Sun, and therefore extend 
further in |b| than if the larger value of d is correct (Schmidt 1957). 


9.1.2 Non-circular motion and the (i,v) plot 


We shall see that observed (i,v) plots deviate significantly from the naive 
{1,v) plot described above, so we must now look at the implications for the 
(1,v) plot of more complex motions in the disk than simple circular rotation. 
There are four topics to discuss: (i) axisymmetric radia] expansion; (ii) oval 
distortions; (iii) spiral structure; (iv) random motions. Since we know that 
many disk galaxies are barred and/or display spiral structure, the motivation 
for discussing topics (ii) and (iii) is self-evident. Similarly, it is obvious that 
gas clouds must have at least small random motions, so it is clear that 
we must understand how these affect the (l,v) plot. The motivation for 
discussing topic (i), axisymmetric expansion, is rather different: this is the 
simplest possible deviation from perfect circular motion and serves as a useful 
introduction to the other topics. Moreover, on account of its simplicity, it 
has often been suggested as an explanation of structures in observed (I, v) 
plots (e.g., Uchida, Morris & Bally 1994). 


Axisymmetric expansion We have seen that a circularly-rotating ring 
interior to the solar circle traces a section of a sine curve in the (I, v) plot. 
Now let this ring be expanding radially as well as rotating. The added expan- 
sion velocity runs perpendicular to our line of sight at the tangent points, and 
hence the observed values of v9, at these points are unaffected by expansion. 
Thus the end points of the ring’s trace in the (1, v) plot are unchanged by the 
addition of expansion. By contrast, the line of sight through the Galactic 
center cuts the ring at points at which the added expansion velocity runs 
parallel to the line of sight. At the nearer intersection of the line | = 0 with 
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Figure 9.7 In the epicycle approxi- 
mation [equations (9.7)] an elliptical 
orbit (large full curve) is formed by 
adding anti-clockwise motion on an 
elliptical epicycle to clockwise motion 
around the dotted circle of radius 
Rg. The small full curve shows the 
location of the epicycle at t = 0; 
the center of this ellipse moves anti- 
clockwise around the circle as the 
object moves clockwise around the 


epicycle. 
2 [_ TT TT T TTT T T T TT id T T a | T 
1 
: % Figure 9.8 The trace in the (I, v) 
2 plot of the elliptical orbit shown in 
Figure 9.7 when viewed down the 
-1 sloping dotted line in that figure. 
The distance from the viewing point 
-2 to the center of the orbit is six times 
10 5 ; ; = ark canemeer the radius of the dotted circle in 
l Figure 9.7. 


the ring, the expansion velocity will cause vj, to be negative, while at the 
further intersection of this line with the ring, expansion causes vo, to be- 
come positive. Along lines of sight either side of 1 = 0 the single value of vos 
expected for pure circular motion is similarly split into two values, the larger 
one deriving from the further intersection of the ring and the line of sight. 
Arguing from continuity, it is now clear that expansion causes the section of 
a sine curve predicted by circular rotation to open up into a curved loop — 
see Figure 9.6. 

If an expanding ring is larger than the solar circle, its expansion is 
immediately apparent by making the average value of vjo, around the ring 
positive — see Problem 9.1. 


Oval distortions We now suppose that the ring under observation is oval. 
The ellipticity of the ring is obviously liable to change the angle it subtends 
on the sky, but more important than this change in angle are the changes in 
velocity that are associated with the ellipticity. Clearly material circulating 
on the ring has to move in and out as it goes around, but the speed with 
which it circulates around the ring also varies from one point to another, and 
we have to draw on dynamical theory to get these changes right. 

We treat the problem in the weak-bar approximation discussed in §3.3 
of BT. In this approximation, the gravitational potential, ®(R, ¢), in which 
objects orbit is the sum of two terms, a dominant axisymmetric term, ®(R), 
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and a weaker barred term of the form ©,(R) cos[2(¢ — Qgt)]. Here By deter- 
mines the strength of the bar and , is the pattern speed of the bar; that 
is, the angular velocity with which the barred potential rotates like a rigid 
body. The coordinates in position and velocity of an orbiting object are, 
like the potential, written as sums of two terms, one due to ®g and one due 
to ®,. In the frame that rotates with the figure of the potential, equations 
(3-120a) and (3-117) of BT yield for motion of the type of interest here 


R= R, + Ccos[2(Q — 24)¢], 


$= —(2—-M)t+ = sin{2(Q — Q,)E], 
vp = —2(Q — N)C sin[2(Q — )e], (9.7) 


Vo = Rd 
~ -(Q—- %){ Re + (C — 2D) cos[2(Q — O,)t}}. 


where C' and D are positive constants determined by the potential. The 
terms in equations (9.7) that are independent of C and D describe clock- 
wise motion around a circle of radius R, at angular velocity 2. The point 
R= R,,¢ = Mt is called the guiding center of the orbit. The terms in 
equations (9.7) that depend on C or D describe anti-clockwise motion around 
an elliptical epicycle. The orbit is a compound of clockwise motion around 
a circle and anti-clockwise motion about the epicycle, which is centered on 
the guiding center and has principal axes aligned with the local radial and 
tangential directions — see Figure 9.7. The orbit is at pericenter (closest to 
the Galactic center) when the epicycle displaces the object inwards, and at 
apocenter (furthest from the Galactic center) when the epicycle displaces 
the object outwards. Since the epicycle is traversed in the opposite sense to 
the circle, the contributions to vg from circle and epicycle add at pericenter 
and subtract at apocenter. 

The axis ratio of the epicycle is determined by the constants C and D 
and thus by the potential. The details need not bother us here; we need 
only note that usually D > C, with the result that the epicycle is elongated 
in the tangential rather than in the radial direction — see Figure 9.7. In 
consequence, the ellipticity of the orbit affects vs more than it does vp. 

The effect of the epicycle upon the (/,v) plot will clearly depend on 
the angle ¢o between the Sun-center line and the major axis of the orbit. 
If this is 0 or 90°, the epicycle merely changes the extent of the orbit’s 
trace in the (I,v) plot; if ¢q = 0, the trace extends to smaller values of | 
and larger values of uj, than if the orbit were circular, and vice versa if 
oo = 90°. For general viewing angle do, the trace becomes a closed curve 
that is reminiscent of a cross-section through an aerofoil — see Figure 9.8. For 
the viewing direction shown by the dotted line in Figure 9.7 (do = 20°), the 
lower tangent point generates the blunt end of the aerofoil, while the upper 
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Figure 9.9 Spiral arms are formed 
when the directions of the long axes 
of oval orbits described by equations 
(9.7) change systematically with ra- 
dius. Here the angle between the 
horizontal and an oval’s long axis is 
proportional to the logarithm of the 
length of the oval’s long axis. The 
circular speed is assumed indepen- 
dent radius and the peak value of vz 
is for any star 10% of vg in the ro- 
tating frame of the spiral. The blob 
at the bottom marks the location of 
‘sun’ at which the (l,v) plot of Fig- 
ure 9.10 would be obtained. One of 
the spiral arms is decorated with a 
series of triangles. 


tangent point generates the curve’s sharp end. For ¢o ~ 70° the positions 
of the sharp and blunt ends of the curve in the (J, v) plot are reversed. 


Spiral structure The central idea of the theory of spiral structure is that 
spiral arms will form if stars and gas clouds move on elliptical orbits whose 
major-axis positions vary smoothly as a function of radius. Figure 9.9 illus- 
trates this idea, which is discussed in detail in §6.2 of BT. Figure 9.9 shows 
a number of oval orbits similar to that of Figure 9.7. Each oval should be 
imagined to be uniformly populated with gas clouds. Because the directions 
of the long axes of the ovals rotate by more than 540° from the smallest to 
the largest ovals, the ovals are in some places much closer to one another 
than in other places. If each oval were uniformly populated by gas clouds, 
the surface density of clouds would be higher where the ovals approach one 
another than elsewhere. In Figure 9.9 it is apparent that the crowded parts 
of the disk form two spiral arms. 

When we use the epicycle approximation to model the kinematics of a 
disk of gas that has a well-defined spiral pattern, we obtain an (I, v) plot like 
that of Figure 9.10. Each oval in Figure 9.9 yields a long, thin, diagonally 
oriented trace in Figure 9.10. As one progresses from small ovals to large 
ones, the ovals’ traces in the (1,v) plot vary from nearly vertical to nearly 
horizontal as their extents in | increase and their extents in vip, decrease. 
Simultaneously, the elongation of the trace varies, in the same way that the 
shape of the trace in Figure 9.8 varies with ¢o. In Figure 9.10 variation in 
the shapes of the (I,v) traces leads to crowding along curves that fan out 
from the origin. Along some of these crowded curves triangles are plotted. 
These mark the images in the (/,v) plot of points in Figure 9.9 that lie on 
one of the spiral arms. Thus in the real world, material that is seen on a 
ridge in the (I, v) plot, lies on a spiral arm. Notice that in Figure 9.10 there 
is a bump in the terminal-velocity curve wherever the latter is touched by 
one of these ridges. 
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Figure 9.10 The (J, v) plot obtained by viewing the disk shown in Figure 9.9 from the blob 
at the bottom of that figure. The spiral arms in Figure 9.9 give rise to strong crowding in 
the (I,v) plot along curves which fan out from the origin. Triangles mark the images in 
this (1,v) plot of the points in Figure 9.9 that are also marked by triangles. 


When observational (1,v) plots first became available, considerable ef- 
fort went into tracing the Milky Way’s spiral arms as follows (e.g. Kerr & 
Westerhout 1965). Crowded regions in the (1,v) plot similar to those seen 
in Figure 9.10 were identified as spiral arms. These arms were mapped back 
into real space by finding the real-space points that would map to them if the 
disk were in circular rotation. However, spiral structure affects the velocities 
of clouds just as much as their real-space coordinates, so the existence of 
spiral structure in the disk is incompatible with material in the disk being 
on circular orbits. Burton (1971) pointed out that the assumption of circu- 
lar orbits can lead to serious errors in the real-space positions assigned to 
the clouds that are responsible for emission at (1,v). These errors are most 
serious when the cloud lies near the tangent point, because then the value 
of Vjog in circular rotation varies very slowly with distance down the line of 
sight, with the consequence that a large displacement in real space will be 
required to compensate for a small contribution to vo, from spiral structure. 

This argument indicates that spiral arms detected in (1,v) plots need to 
be modeled dynamically as in Figure 9.10, rather than with the simple kine- 
matic approach described above. In particular, the density contrast between 
arm and inter-arm regions is greatly over-estimated by naive kinematic mod- 
eling, as is hinted at by Figures 9.9 and 9.10; the amplitude of the arms is 
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larger in the (i,v) plot than in real space, which suggests that the velocity 
shifts associated with spiral structure tend to enhance the strengths of arms 
in (1, v) plots. 

In the light of this discussion, it is surprising to discover that naive 
kinematic modeling leads to accurate maps of the ridge-lines of arms in real 
space. Figure 9.11 illustrates this result by showing the contribution to vI95 
from spiral structure for some of the points in Figure 9.10: each triangle 
indicates the location in the (1, v) plot of some point in the disk when spiral 
structure is present, while the corresponding locations in the absence of spiral 
structure lie at the further ends of the lines that are attached to the triangles. 
Many (but not all) of the points that lie along spiral arms, appear to have no 
attached lines because the true value of v),, differs negligibly from the value 
for circular rotation. Hence the real-space positions of these points can be 
accurately recovered under the assumption of circular rotation, and, when 
thus recovered, will faithfully trace the spiral arms in real space. 

In Figure 9.11 there are in the neighborhood of each spiral arm many 
points that have long lines attached, indicating that for the associated clouds 
the true value of v,, differs significantly from that associated with circular 
orbits. Consequently, the real-space locations of these clouds cannot be accu- 
rately recovered under the assumption of circular rotation. It turns out that 
the errors in their positions made by assuming they are in on circular orbits 
nudges the recovered real-space points towards the ridges of the nearby arms, 
thus enhancing the recovered arm/inter-arm contrast as mentioned above. 


Random motions From §8.2 we know that clouds do not move on per- 
fectly circular orbits because the clouds at any given location have velocities 
that spread over a few kms~!. Moreover, within a given cloud there is a 
spread in the velocities of individual H atoms. Qualitatively it is easy to 
see that this velocity spread will modify (J,v) plots by smoothing them in 
velocity. Such smoothing is potentially important wherever the brightness 
temperature is varying rapidly with velocity, as it does, for example, near 
the terminal velocity. 

For simplicity we assume that in each small volume d*x of the disk the 
distribution of atomic velocities is Maxwellian — the volume should be large 
enough to encompass several clouds but small enough to be unresolved by 
the telescope. The number of atoms in the volume element with velocities 
in dv is then 


_ n(x) dx d3v ‘= pee). (9.8) 


eS (270?)3/2 ae 202 


Here g is the atoms’ velocity dispersion (which may in principle be a function 
of x) and V(x) is the mean velocity of the atoms in d°x. Now if V(x) is the 
circular velocity at x, we know by equation (9.3) that an atom that moves 
with this velocity has line-of-sight velocity 


vu (x) = {Q[R(x)] — 2(Ro)} Ro sin. (9.9) 
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Figure 9.11 The triangles show the locations of a selection of 
points on the curves of Figure 9.10. If the disk were in pure circu- 
lar rotation, material at the corresponding real-space points would 
have the velocities that lie at the ends of the lines that are attached 
to each triangle. Thus the length of each line gives an indication 
of the magnitude of the error in the real-space position that one 
would obtain for that cloud by assuming it was on a circular orbit. 


It follows that an atom with a random velocity 6v = v — V has line-of-sight 
velocity 


°) (z) + due, (9.10) 


= at 
Vlos = Vig 


where we have oriented our coordinate system such that the x and y axes 
run perpendicular to the line of sight and the z axis runs parallel to it. We 
obtain the surface density of atoms in the distance interval dz and with line- 
of-sight velocities within dév, of the value of ujos given by equation (9.10), 
by dividing equation (9.8) by dady and integrating the result over déu,dévy: 


7 lév|? /20? 


0 
PN [vo (z) + 6v2] = n(z)dz ddvz i dév, ; dév, nats 


e7 52/207 


V2no2 © 


Finally, we find the total column density of atoms with any given line-of-sight 
velocity uj, by summing the contributions dN over increments in the path 
length dz with év, chosen such that yo) (z) + dvz = tos: 


(9.11) 
= dzn(z)dév, 


AN (2tos)-_ [eens ePlcle = Uige (2) ?/207}, 


9.12 
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where we have used dév, = dujs. Comparison of Figures 9.4 and 9.12 
shows for one longitude, | = 20°, the effect of random velocities on optical 
depth 7, which, by equation (8.16), is proportional to dN/dujos: both figures 
are for the same distribution of hydrogen within the disk, but Figure 9.4 
shows 7 for the case of negligible random velocity, while Figure 9.12 shows 
T for og = 0.04v,(Ro) ~ 8.4kms7!. It can be seen that random velocities 
significantly reduce the peaks in rT. 

In the limit o — 0, the fraction in equation (9.12) tends to a Dirac 6- 
function. We can then use a standard formula? for a 6-function of a function 
to evaluate the integral over z: 


dN (tutos) n(zi) 
AN (Hos) _ 3 =U) 5 9.13) 
dutos i : 
: zy s.t. v (2;)=utos 


Figure 9.3 was plotted by substituting values of N from this equation into 
equation (8.20). The presence of Jan /dz;| in the denominator of equation 
(9.13) has the important implication that points at which the velocity gra- 
dient is small can make very large contributions to N. At the tangent point, 
jdv(® /dz;| actually vanishes and equation (9.13) fails. Hence, the optical 
depth near the tangent velocity is sensitive to the magnitude of the random 
velocities in the gas, no matter how small these are. 


2 For any function f(x) which never touches the z axis we have 


= 6(x — x;) 
VOM rtone HF 
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9.2 The large-scale distribution of HI and CO 


The disk of the Milky Way falls naturally into three parts: the outer disk at 
R> Ro; the middle disk, 3kpc < R < Ro; and the central disk, R < 3kpc. 
The central disk, which is complex and only partially understood, is discussed 
in §9.4. In this section we focus on the middle and outer parts of the disk. 

The main distinction between the middle and the outer disk relates 
to the techniques used to study them: at R < Rp we concentrate on the 
neighborhood of the terminal-velocity curve in the (l,v) plane, and thus 
study the disk primarily at the tangent points. At R > Ro there are no 
analogs of tangent points, and we are heavily reliant on the rather uncertain 
determination of the Milky Way’s circular-speed curve. 

However, besides this operational distinction, there prove to be objective 
differences between the middle and the outer disk. The Sun happens to lie 
near the outer edge of the part of the disk that contains most of the Milky 
Way’s molecular gas and luminous star formation. Also, we shall see that 
inside the solar circle the disk is extremely flat, and by 1.5 it is significantly 
warped. 


9.2.1 The 21-cm line in emission 


The HI in the Milky Way has been surveyed many times in the last half- 
century. Table 9.1 lists the characteristics of the surveys that are of more 
than historical interest. Since aperture-synthesis telescopes are insensitive to 
faint, diffusely-distributed emission, all these surveys have been done with 
single-dish instruments. It would, moreover, be extremely time-consuming 
and of limited scientific interest to map the entire Milky Way with the angular 
resolution characteristic of the largest single-dish instruments. Consequently, 
these surveys have employed relatively small dishes of diameter ~ 25m. The 
angular resolution of the surveys is thus of order a degree, with the conse- 
quence that they each contain in excess of ten thousand resolution elements 
and require of order one year of observing time for their construction. 

An ideal radio telescope would have a simple and compact ‘beam,’ or 
pattern of sensitivity on the sky. That is, when pointed in a given direction, 
it would respond sensitively to a source that lay in a narrow cone about that 
direction, and be highly insensitive to sources elsewhere. Unfortunately this 
ideal is hard to realize in practice, and most real telescopes have complex 
beams that vary significantly with the orientation of the telescope with re- 
spect to the ground, surrounding buildings, and so on. Radio astronomers 
frequently adopt a simple model of a telescope’s beam and then consider the 
telescope’s response to sources that lie outside this beam to arise from ‘leak- 
age’ of radiation into the beam. At low flux levels the uncertainties in most 
of the surveys listed in Table 9.1 are dominated by the leakage of radiation 
into the adopted telescope beam rather than by thermal noise in the tele- 
scope and receiver. That is, the quality of the published data is determined 
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Table 9.1 HI surveys of the Milky Way 


Survey Aé@ l b v (kms~?) Toe 

Weaver & Williams 36’ 10° to 250° —10° to 10° —100 to 100 1.7K 

(1973,1974) Al = 0.5° Ab = 0.25° Av = 2.1 (6.3) 

Heiles & Habing 36’ all J at 6 > —30° all b at |b] > 10° —92 to 75 1.2K 

(1974) Al =0.3°/cosb Ab =0.6° Av = 2.1 (6.3) 

Bania & Lockman 4! 32° to 64° —3° to 3° —250 to 250 0.6K 

(1984) Al = 1.3° Ab = 2! Av = 1.03 

Burton (1985) 21’ alll at 6 > —46° all 6 at |b| > 20° -—250to 250 0.2K 
Al=1° Ab=1° Av = 1 (2.1) 

Kerr et al. (1986) 48’ 240° to 350° —90° to 10° —150 to 150 0.8K 
Al = 0.5° Ab = 0.25° Av=2 

Stark et al. (1992) 150’ alll at 6 > —30° all b —300 to 300 0.1K 
Al =0.5°/cosb Ab=0.5° Av = 5.3 (2.1) 

Leiden-Dwingeloo 36’ all 1 at 6 > —30° all b ~—460 to 460 0.07 K 
Al =0.5°/cosb Ab=0.5° Av = 1 (2.1) 


NOTES: A@ gives the beam diameter and T,yj, the sensitivity of each survey. 
SOURCE: Adapted from Burton (1992) 


by the accuracy with which it has been possible to determine the telescope’s 
beam and thus to determine where the radiation that was detected was ac- 
tually coming from. This problem of beam determination is most acute in 
directions in which the HI column density is small, but which are close to 
directions in which the column density is large. The survey of Stark et al. 
(1992), which was done with the horn antenna with which Penzias and Wil- 
son discovered the cosmic background radiation in 1965, is least affected by 
radiation leakage because the antenna was originally designed to have the 
simplest possible, and therefore most easily determined, beam. 

Figure 9.13 is the (1,v) plot for b = 0 from the Leiden—Dwingeloo-IAR 
survey of the Galaxy in the 21-cm line. We recognize several features from 
the theoretical discussion of 89.1.1. In particular, the curves which bound the 
band of significant emission in Figure 9.13 are similar in shape to those which 
bound the occupied zone of Figure 9.3. Both figures show strongly discontin- 
uous behavior near | = 0, with the bounding curves showing marked horns 
either side of the origin: in Figure 9.13 there are horns at v ~ 140kms~! 
near 1 = 15° and v ~ —150kms~! near | = —15°. As one moves away from 
the Galactic center, the observed terminal velocities fall steeply from the val- 
ues associated with the horns to near zero at 1 = +90°, just as expected from 
Figure 9.3. Moreover, in Figure 9.13 the boundary at (I > 0, v < 0) and at 
(1 < 0, v > 0) can be approximated by a sine curve, just as in Figure 9.3. 

Quantitatively, there are important differences between Figures 9.13 
and 9.3. For example, in Figure 9.13 the horns of emission at small val- 
ues of [I| are much fainter, relative to the peak intensity at, say, 30° than 
are the corresponding horns in Figure 9.3. The obvious interpretation of 
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Figure 9.13 Brightness temperature arising from the 21cm line in the Galactic plane 
(b = 0), plotted as a function of Galactic longitude, |, and line-of-sight velocity, v. The 
data come from the Leiden—Dwingeloo Survey (Hartmann & Burton 1997) and the com- 
plementary southern hemisphere IAR survey by Bajaja et al.. They were kindly provided 
by D. Hartmann. 


the faintness of the horns in Figure 9.13 is that there is very little HI at 
radii R < 3kpc, which correspond to |l| < 20°. Figure 9.3, by contrast, was 
calculated for a disk in which HI continues to the very center. 

Another significant quantitative difference between Figures 9.13 and 9.3 
concerns the amplitudes of the sine curves that best fit the boundaries of the 
band of emission at (J > 0, uv < 0) and at (1 < 0,v > 0). In Figure 9.3 
this amplitude is ~ 85% of the circular speed at. the solar radius, whereas in 
Figure 9.13 it is ~ 130kms~!, which is only ~ 60% of the local circular speed 
given in Table 10.7. Does this unexpectedly small value indicate that the 
observations have not been pushed to sufficiently high sensitivity to reveal 
faint emission beyond the apparent boundary in Figure 9.13, or is there really 
negligible emission beyond a sine curve of amplitude ~ 130kms~‘? Knapp, 
Tremaine & Gunn (1978) have investigated this question and concluded that 
there really is no significant emission beyond the fitted sine curve, presumably 
because the Milky Way’s HI disk does not extend beyond R ~ 21kpc (see 
Problem 9.4). 

A significant feature in Figure 9.13 that has a counterpart in Figure 9.3 
is the ridge along v ~ 0 at —90° < 1 < 90°. In Figure 9.3, this ridge is 
associated with a jump in the optical depth by a factor of 2 — see Figure 9.4. 
Figure 9.5 explains the origin of this jump: for 0 < 1 < 90°, two identi- 
cal points in the disk contribute to emission at v > 0, whereas only one 
point contributes to emission at v < 0. The appearance of this jump in a 
brightness-temperature map such as Figure 9.13 demonstrates that Tg de- 
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Figure 9.14 The curve studded with squares shows the 2K contour that bounds the 
zone of emission in the upper-right quadrant of the (1, v) plot for 21-cm emission averaged 
over |b| < 10°. The other curve shows the 2K contour that bounds the emission in the 
lower-left quadrant of the (1, v) plane after inversion through (1, v) = (0,0). The systematic 
displacement of one curve with respect to the other is evidence that the Milky Way’s outer 
HI disk is not axisymmetric. [After Blitz & Spergel (1991a) from data kindly provided by 
L. Blitz] 


pends strongly on the optical depth 7 even near v = 0, where the emission is 
strongest. From this sensitivity of Tg to 7 it follows that in most directions 
the disk is not very optically thick in the 21-cm line. 


The maximum brightness temperature waar) observed in the plane 
at longitude | peaks at ~ 135 K for some lines of sight near / = 0, and rarely 
falls below ~ 80K. While there is considerable scatter in T$"™” (1) between 
neighbouring lines of sight, large values typically occur near | = 0, | = 180° 
and | = +75°, where emission tends to be concentrated into small velocity 
ranges and therefore the disk is most nearly optically thick (Burton 1992). 
Since Tg should approach the spin temperature T;,j, defined in Box 8.3 at 
high optical depths, these observations suggest that Typin Y 135K. 

If the disk were axisymmetric and in simple circular rotation, any ob- 
served (l,v) plot would be symmetric under inversion through the origin, 
that is under 1 — ~—I = 360° —I and v — —v. Figure 9.14 reveals an in- 
triguing breakdown of this symmetry. Each curve in the figure shows the 
location of a 2K contour in the (1,v) plot for 21-cm emission averaged over 
|b] < 10°. The curve studded with squares shows the contour that bounds 
the zone of emission in the upper right part of the (i,v) plane, while that 
studded with triangles shows the result of inverting through (J,v) = (0,0) 
the contour that bounds the zone of emission in the lower left part of the 
(1,v) plane. If the zone of emission in the (1, v) plane were symmetrical as in 
Figure 9.3, the two curves would lie on top of each other. In reality the curve 
for | < 0 is displaced to the left with respect to the curve for | > 0. Blitz 
& Spergel (1991a) have argued from this phenomenon that the outer Milky 
Way is oval, and that the LSR is currently receding from the Galactic center 
on an oval orbit. Kuijken & Tremaine (1994) argue that a more plausible 
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Table 9.2 CO surveys of the Milky Way 


Survey Beam U b v (kms~*) Sensitivity 
Clemens et al. 0.8’ 8° to 90° —1.05° to 1.05° —50 to 150 1.2K 
(1986) Al = 0.125° Ab = 0.05° Av=1 

Dame et al. 8.7' —180° to 180° —3.3° to 3.3° —80 to 80 0.2K 
(1987) Al=0.5 Ab = 0.5° Av=13 

Stark et al. 0.6! —5° to 122° —1° to 1° —80 to 80 0.3K 
(1988) Al = 3! Ab=3! Av = 0.7 


SOURCE: Adapted from Burton (1992) 


interpretation of the data is that the outer parts of the Milky Way, like those 
of so many external galaxies, are lop-sided (cf. §8.2.3). 


Measuring the spin temperature While the estimate Typin ~ 135K 
obtained above is interesting, studies of 21-cm absorption in the spectra of 
extragalactic radio sources reveals that it is rather crude. From equations 
(8.18) and (8.16) we may show that the intensity of 21-cm absorption in 
the spectrum of a continuum source that is observed through a foreground 
hydrogen cloud always depends sensitively on the value of T;pin in the cloud. 
In equation (8.16) we equate T'g(0) with the brightness temperature of the 
background source. Typically this temperature will be very much larger 
than the spin temperature in the ISM, with the consequence that, when the 
beam includes the background source, we may neglect the second term on the 
right-hand side of equation (8.16) and approximate the measured brightness 
temperature by 

Ton = sourcee (9.14) 


This equation, taken with the assumption that Tyource is a weak function 
of v in the neighborhood of v = 1.42 GHz, enables us to determine 7, as a 
function of vy. Applied to a line of sight slightly away from the background 
source along which Tg(0) = 0, equation (8.16) yields 
Tp 

l—-e 

If the angular size of the cloud of foreground hydrogen is much greater than 
the beam, 7, will not vary significantly as we slew the beam to a position 
that just excludes the source. Hence we may use our previously determined 
values of 7, in equation (9.15) to derive Typin from the measured value of Tg 
for an off-source line of sight. 

The values of T,pin that are obtained by the procedure just described are 
found to vary systematically with +: the more optically thick a line of sight 
is in the 21-cm line, the lower is the recovered value of Typin. Specifically, 
clouds with peak optical depth 7 > 0.2 yield 20K S Typin S 110K with a 
mean value ~ 80K. As the peak optical depth decreases towards 0.01, the 
mean value of T;pin rises to ~ 300 K (Spitzer 1978, Kulkarni & Heiles 1988). 


Tepin = (9.15) 
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Figure 9.15 An (I,v) plot of the Milky Way in the 2.6-mm line of 12CO. The grayscale 
shows the intensity of the line averaged over a ~ 4° strip centered on the Galactic plane. 
[After Dame et al. (1987), but incorporating more recent, higher sensitivity data kindly 
provided by T. Dame] 


9.2.2 CO lines in emission 


In §8.1 we saw that the most important probes of molecular gas in galaxies 
are the rotation lines of various isotopes of CO, and especially the 2.6-mm 
line of !2CO. Table 9.2 lists some important surveys of the Milky Way in this 
line. The velocity resolution and sensitivity of these surveys are comparable 
to those of the 21-cm surveys listed in Table 9.1. By contrast, the angular 
resolution of the CO surveys is significantly higher than that of the 21-cm 
surveys. This difference is a consequence of the fact that a dish of given size 
yields nearly two orders of magnitude greater angular resolution at 2.6mm 
than at 21cm, and good sensitivity cannot be obtained with a very small 
dish. At the high angular resolution of the CO surveys, the Milky Way 
contains so many resolution elements that observing them all would require 
an unobtainable allocation of telescope time. Hence none of the surveys listed 
in Table 9.2 covers more than a small fraction of the sky. The best coverage 
is provided by the survey of Dame et al. (1987), who went to some trouble to 
use unusually small dishes. Fortunately, CO is more strongly confined to the 
plane than HI, and it unlikely that the Dame et al. survey missed significant 
CO emission. 

Figure 9.15 shows the (1, v) plot of the Milky Way in the 2.6-mm line of 
12CO from Dame et al. (1987). The emission is more patchy than the 21-cm 
emission shown in Figure 9.13, but certain prominent features are apparent 
in both plots. In particular, in both plots the envelopes at (J > 0, uv > 0) 
and at (1 < 0, v < 0) decline to the origin from prominent horns, although 
the horns of the CO emission are at larger longitudes |I| ~ 25°, and slightly 
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Figure 9.16 Terminal velocities a?) as a function of |l| for HI 


(open symbols) and CO (filled symbols). Data for 1 > 0 from 
Bania & Lockman (1984) (open triangles), Weaver & Williams 
(1973,1974) (open squares) and Malhotra (1995) (filled triangles); 
for | < 0 from Kerr et al. (1986) (open pentagons). The curve is a 
fit to the data by the sum of a linear function of |!| and a cosine 
series in |/|. [From data kindly supplied by S. Malhotra] 


smaller velocities |v| ~ 130kms~1. These horns and the diagonal swath of 
emission that joins them are often interpreted as arising from a molecular 
ring of radius Ro sin 25° ~ 3.6 kpc, which we discuss further below. 

At positive longitudes the terminal-velocity curve in Figure 9.15 seems 
to fall to zero before 1 = 90° is reached, although at I < 0 the terminal- 
velocity curve reaches v = 0 near | = —90° as expected. There is very little 
emission in the regions 90° < 1 < 180°, v < 0 and —180° < 1 < —90°, 
v > 0 associated with gas that lies beyond the solar circle, although there is 
emission near v = 0 at all longitudes. This emission presumably comes from 
the solar circle and probably from the solar neighborhood. What emission 
there is at R > Ro seems to be associated with the ridges of intense 21-cm 
emission in Figure 9.13 that were identified with spiral arms. In a detailed 
study of CO emission from just outside the solar circle, Digel et al. (1996) 
found that the intensity of CO emission is at least 25 times smaller between 
spiral arms than in the arms. 


9.2.3 The Milky Way’s circular-speed curve 


The 21-cm and CO-line data reviewed above provide the key to determining 
the circular speed v,(R), and hence the radial distribution of mass, in the 
Milky Way. In §9.1.1 we saw that for R < Ro we determine vu,(R) by esti- 


mating the terminal velocities oy) (1) from the (1,v) plot and inserting them 
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Table 9.3. Mean terminal velocities from 21-cm data 


l 15° 20° 30° 40° 50° 60° 70° 
v/kms7! 148 125 103 73 56 33 14 


SOURCE: Data plotted in Figure 9.16 
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Figure 9.17 The circular-speed curves that follow from the tan- 
gent velocities given by the curve in Figure 9.16 and equation (9.6) 
for vc¢(Ro) = 180kms~? (full curve) and u-(Ro) = 220kms~! 
(dashed curve). 


into equation (9.6). The terminal velocities determined from 21-cm and 
CO line data are in good general agreement with one another. Figure 9.16 
shows the quality of this agreement: both the general trend and small-scale 
variations in v‘) with I are reproduced in both the HI and CO data. The 


los 

small-scale fluctuations of vt) about its the smooth underlying trend with 
I have amplitude of order 7kms~!. This value may be considered to be 
an estimate of the influence of the Milky Way’s spiral structure on orbital 
velocities. 

The differences in Figure 9.16 between the values of u(t) (1) for HI and 
CO at fixed I are distributed as a Gaussian centred on zero and with a 
dispersion of 4.1kms~! (Burton 1992). This value provides an estimate in 
the dispersion in the random tangential velocities of interstellar clouds. 

The values of v(t) determined at +/ generally agree to better than 
10kms~! (Kerr & Rogers 1964, Blitz & Spergel 1991a). While more ac- 
curate agreement is not expected if each side is independently perturbed by 


spiral structure of amplitude 7kms~', it is perhaps surprising that there is 


3 See Celnik, Rholfs & Braunsfurth (1979) and Malhotra (1995) for details of how 
(t 


of) (1) can be estimated from the data. 
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a tendency for v(t) to be systematically lower for | in the range (—30°, —50°) 


than in the corresponding range of positive longitudes. Table 9.3 gives mean 
values of u(t) for 0 < |l| < 90°, and Figure 9.17 plots the resulting circular- 
speed curves for two values of v.(Ro). Notice that the larger the assumed 
value of v.(Ro), the larger is the predicted gradient du./dR at the Sun. This 
behavior reflects the fact that equation (9.6) expresses uv, as the sum of a 
declining function of | and a rising function of | whose coefficient is v.(Ro). 

Beyond the solar circle we cannot determine R in equation (9.3) kine- 
matically as we do when using equation (9.6), and must devise an alternative 
way of estimating the Galactocentric radius R of each object for which we 
Measure Uj,;. The usual procedure is to determine the distance d of the 
object from the Sun, and to obtain R from the law of cosines, 


R? = R2 + d? — 2Rodcosl. (9.16) 


The standard technique is to determine the distance to a young cluster by 
main-sequence fitting (§6.1.2) and then to measure the cluster’s velocity from 
radio-line observations of associated molecular gas — see Brand & Blitz (1993) 
for references. An alternative approach involves observations of Cepheid 
variables (§5.1.10), which simultaneously yield both distances and velocities 
(e.g., Pont et al. 1997). Planetary nebulae and Carbon stars have also been as 
tracers of the outer circular-speed curve — see the review by Fich & Tremaine 
(1991) for references to the literature. 

Interstellar absorption gives rise to significant errors in the distances d 
to tracers of Galactic rotation at R > Ro and it is important to understand 
how distance errors bias the circular-speed curve that one derives from such 
tracers. To this end, we define 


W = 2 = R,[2(R) — A(R)].- (9.17) 
sinl 
Here vis is the line-of-sight component of the relative velocity between the 
LSR at the Sun and at the location of a tracer that lies distance d from the 
Sun at longitude 1, and we have used equation (9.3). Since J is measured pre- 
cisely, errors in W simply reflect errors in vjpgs, which are usually dominated 
by the peculiar velocities of tracers rather than by measurement errors. 
Equation (9.17) shows that what we measure is a simple multiple of the 
difference between the circular frequency at the estimated radius R of the 
tracer and at the Sun. Hence, an error in the distance d of a tracer, which 
translates through equation (9.16) into an error in R, leads us to assign the 
measured frequency difference W/Rp to the wrong radius R. In particular, 
tracers that in reality all lie on a ring of radius R,, and therefore lead to 
similar measured values of W, will appear to be distributed throughout an 
annulus whose width depends on the magnitude of the errors in their dis- 
tances; the measurements of the velocities of such tracers will falsely imply 
that Q(R) is constant through this annulus and therefore that v.(R) « R. 
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Figure 9.18 The top panel 
shows 98 estimates of W(R) 
/ [equation (9.17)] from Brand 

| & Blitz (1993). The lower panel 

shows the values of v-(R) that 

b | one derives from these data. [Af- 
Ca hae a 0 Oe, “55 ter Binney & Dehnen (1997)] 
R/R, 


The top panel of Figure 9.18 shows measurements of W for 99 tracers 
from Brand & Blitz (1993). Most errors in R are uncomfortably large, espe- 
cially at R 2 1.2Ro, while the errors in W are rather small even though they 
include both the measurement errors and a contribution of 7kms7! from 
the estimated peculiar velocities of the tracers. The top panel of Figure 9.18 
suggests that W falls steeply at Ro < RS 1.2Rpo and then falls gradually or 
not at all. The lower panel of Figure 9.18 shows the values of v, that corre- 
spond through equation (9.17) to the data shown in the top panel. For the 
reason explained above, the errors in d now give rise to diagonal error bars 
along which v, « R. Partly on account of the slope of these error bars, this 
panel conveys the impression that the circular-speed curve rises steadily at 
R 2 1.2Rp after falling at R$ 1.2Ro. This behavior of u.(R) is dynamically 
implausible, however, and Binney & Dehnen (1997) argue that in a more 
plausible interpretation of the data, most of the points at R > 1.25Rpo are 
for objects that lie around a ring of radius R ~ 1.6Ry whose circular speed 
is lower than that at Ro by ~ 8kms7!. 

If the circular speed of the Galaxy were falling off in Keplerian fash- 
ion, ve(R) « R71/2 at R > Ro, it would fall by > 50kms~! between Ro 
and 1.6Ro. Such a rapid decline is ruled out even by the very uncertain 
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Figure 9.19 Full curve: the average 
density of neutral hydrogen atoms 
in an annulus around the Galactic 
center, versus radius of the annulus. 
Dashed curve: the corresponding 
surface density of molecular gas for 
the assumption that X is given by 
equation (8.28). Ro has been as- 
Cae sumed to be 8.5 kpc and emission 
oO Lreniitiiiitis piu tii | from the innermost 1.5kpc is not 
) 5 10 15 20 shown. [After Dame (1993) from 
R/kpe data kindly provided by T. Dame] 
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data shown in Figure 9.18. In §8.2.4 we saw that the circular-speed curves 
of external galaxies are generally flat or rising to the last observed point. 
Therefore it would be surprising if Figure 9.18 did not tell a similar story. 
Historically, however, the discovery by Jackson, Fitzgerald & Moffat (1979) 
that the rotation curve to R ~ 1.6Rp does not decline beyond Rp as the 
classic galaxy model of Schmidt (1965) requires, played a significant role in 
undermining the traditional belief that light would be a good tracer of mass. 


9.2.4 Radial distributions of HI and CO 


As we explained in §9.1.1, once the Milky Way’s circular-speed curve has 
been determined, we can immediately deduce the radius R that is responsible 
for emission at any point in the (I,v) plane. Then if the disk is optically 
thin at the given I and v, equation (8.20) relates the measured brightness 
temperature Tp(l,v) to the column density N, and this is in turn related 
to the atomic density in the disk, n(R), by equation (9.13). The relation 
between Tp(l,v) and n(R) is significantly more complex at points in the’ 
{l,v) plane that are optically thick, but at least a lower limit on n(R) may 
be set by proceeding as if the disk were optically thin. If an estimate of the 
optical depth can be obtained, equation (8.21) can be used to obtain the 
column density N, from which n(R) follows as before. 

In §8.1.4 we argued that it is probable that the optically-thick cores of 
molecular clouds do not strongly shadow one another. In this case N(I,v) is 
proportional to the measured brightness temperature Tp(l,v) and equation 
(9.13) can again be used to derive n(R). 

The full curve in Figure 9.19 shows the azimuthally-averaged distribu- 
tion of HI in the Milky Way that one obtains as described above. Inte- 
rior to R ~ 4kpc the density falls from a wide plateau to near zero inside 
1.5kpe. Beyond Ry the density remains large to R ~ 16 kpc, where it turns 
sharply downwards. Notice that the mass of HI is proportional to the inte- 
gral f nyRdR, in which values of nq are weighted by R. Consequently, the 
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Figure 9.20 The dots show the estimated positions within the Gal- 
axy of giant molecular clouds; the Galactic center lies at the coor- 
dinate origin and the Sun’s location is marked by ©. A logarithmic 
spiral with a pitch angle of 10° delineates the Sagittarius—Carina 
arm. A short section of a similar logarithmic spiral suggests the 
location of the Orion—Cygnus arm. A dotted circle of radius 0.4Ro 
emphasizes the elongated distribution of clouds in the molecular 
ring. [From data kindly provided by P. Solomon and data pub- 
lished in Grabelsky et al. (1988)] 


proportion of the Milky Way’s HI that lies at R > Ro is larger than casual 
inspection of Figure 9.19 would suggest: Figure 9.19 implies that 80% of the 
~ 4.3 x 10° Mg of Galactic HI lies beyond Ro. 

The dashed curve in Figure 9.19 is the corresponding plot for molecular 
hydrogen. The most striking feature of this curve is the implication that H2 
is almost entirely confined to R < Ro: 77% of the ~ 109 Mo of molecular 
material in the Galaxy appears to lie inside Ro. This narrower confinement 
of Hz relative to HI is in line with the results for external galaxies reviewed 
in §8.2.2. 

The Milky Way seems to be one of the minority of galaxies that have 
a central depression in their H2-disks. In fact, at 1 > 0 a large fraction of 
the Hz appears to be concentrated into a ring of mean radius ~ 4.5 kpc and 
full width at half maximum ~ 2kpc. The data for / < 0 suggest that the H2 
distribution is bimodal, peaking around R ~ 2.5kpc and R ~ 6kpc. It has 
been suggested that the H. may not be concentrated into a ring as much as 
into two giant spiral arms, and that these arms may be connected with the 
central bar to be discussed below (Dame 1993). 
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9.2.5 Evidence for spiral structure 


Some of the clearest evidence of spiral structure in the Milky Way comes from 
(l,v) plots of emission by CO. In §9.6.1 we shall see that catalogs of giant 
molecular clouds can be constructed by analyzing such plots. Figure 9.20 
shows the result of plotting the positions in the Galaxy of the most massive 
clouds in two such catalogs for a constant circular speed, v, = 220kms7! 
and Ro = 8.5kpc in the way described in §9.1.2. Two factors complicate the 
interpretation of this figure. First, it is based on the questionable assumption 
that gas moves on circular orbits. Second, the catalogs from which it is 
constructed by no means cover the entire Galaxy. In particular, at positive 
longitudes coverage is confined to R < Ro, while at negative longitudes only 
R> Ro is covered. Consequently, the figure surely omits some features that 
are important in the Galaxy. Nevertheless, the figure clearly shows that giant 
molecular clouds have a tendency to concentrate into elongated structures. 

At Galactocentric radii R ~ 5 Ro, a broad roughly elliptical ridge in the 
density of molecular clouds is apparent towards the left of Figure 9.20. This 
feature, which accounts for roughly half of the molecular gas in the Galaxy, 
is identical with the molecular ring discussed above. While the molecular 
ring may really be elliptical as the figure suggests, one should not feel much 
confidence in Figure 9.20 at small radii because in §§9.4 and 10.1 we shall 
see that the inner few kiloparsecs of the Milky Way are dominated by a bar. 
Consequently, clouds at radii as small as $Ro are probably on significantly 
non-circular orbits; Figure 9.20, by contrast, is based on the assumption 
of perfectly circular orbits. In particular, it is perfectly possible that the 
molecular ring is in reality a pair of massive spiral arms as we suggested at 
the end of the last subsection. 

Figure 9.20 suggests that the Sun lies near the outer edge of a spiral 
arm that can be traced through nearly one turn about the Galaxy. The 
ridge-line of this spiral arm is quite accurately represented by the dotted 
curve in Figure 9.20, which is a logarithmic spiral with a 10° pitch angle. 
For historical reasons, the part of this arm that lies inside Rp and at positive 
longitudes is called the Sagittarius arm, while the part that lies at negative 
longitudes is called the Carina arm. 

Another, less conspicuous, spiral arm is just evident at the bottom left 
of Figure 9.20; a short section of a logarithmic spiral very approximately 
follows a chain a clouds. These clouds form part of the Orion—Cygnus arm, 
which probably passes just outside the Sun. Unfortunately, this arm lies 
near the edge of the region covered by the catalogs from which the figure is 
constructed, so no physical significance attaches to the complete absence of 
clouds further from the center than this arm. In §10.4.5 we shall see that 
young star clusters are concentrated along three ridges: the Sagittarius— 
Carina arm, the Orion—Cygnus arm, and a third arm, the Perseus arm, that 
passes the Sun significantly further out. 

To search for spiral structure well outside Ro we must turn from the 
distribution of molecular clouds, which are scarce at large radii, to (I,v) 


562 Chapter 9: The Milky Way’s ISM 


Figure 9.21 Mapping 
of spiral arms from the 
(1,v) plot to real space. 
The top panel shows a 
section of Figure 9.13, 
with points marking 
the most prominent lin- 
ear features. The lower 
panel shows how these 
points map into real 
space. This mapping 
assumes that the hydro- 
gen follows circular or- 
bits at ve = 220kms~?}, 
and that Ro = 8.5kpc. 
The Galactic center 

lies at the origin and 
the Sun’s location is 
marked by ©. 


plots for HI emission. Careful examination of Figure 9.13 reveals that the 
terminal-velocity curve at (0 <1 < 90°, v > 0) and at (0 > 1 > —-90°, v < 0) 
is bumpy, and points of unusually high intensity seem to delineate loops 
reminiscent of those generated by spiral structure in Figure 9.10. The upper 
panel of Figure 9.21 shows ridge-lines of spiral arms in the HI (I,v) plot 
that have been formed by connecting local maxima in Tg. The lower panel 
shows the result of mapping these arms from (I, v) space to real space under 
the assumption of circular orbits discussed in 89.1.2. A series of trailing 
spiral arms is apparent. One of these is clearly coincident with the Carina 
arm. Another arm passes about 2kpc from the Sun in the direction of the 
anticenter. This is the Perseus arm. 


9.2.6 Vertical distributions of HI and CO 


Hitherto we have concentrated entirely on that part of the (1, b,v) data cube 
that lies near the plane b = 0. Now we ask what can be learnt from the 
variation of observed brightness temperature with b. Let (R, ¢, z) be asystem 
of Galactocentric cylindrical coordinates oriented such that the plane z = 0 
is identical with the plane 6 = 0. We wish to determine the value z, of z 
at which the density of some tracer of the ISM such as CO emission peaks 
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for given values of (R,¢). We call z, the central value of z at (R,¢). In 
addition, we want to know how far in z one must move from (R, ¢, z-) for 
the density of the given tracer to fall to a half of its value at z = z,. We call 
this distance the half-thickness at half-intensity of the tracer, 21/2. 

Since the HI and CO layers turn out to be very thin, it is not surprising to 
find that the mean velocity of the ISM at (R, ¢, z) lies near the circular speed 
at the corresponding point (R,¢,0) in the plane (Lockman 1984). That is, 
the ISM approximately rotates as if it were fixed on a nested sequence of rigid 
cylinders, which are all rotating about the same axis through the Galactic 
center. In this approximation, every feature in the (J, v) plane generalizes to 
a feature in the full (1,b,v) data cube. In particular, the terminal-velocity 
curve generalizes to a terminal-velocity surface. 

Since we wish to determine the extent of gas in z from measurements 
of its extent in b, any ambiguity in the distance from which observed line 
emission comes is highly undesirable. In §9.1.1 we saw that, for a disk in 
simple circular rotation, a given point in the (l,v) plane corresponds to a 
unique location in the disk at R < Rp only if (I,v) lies on the terminal- 
velocity curve. Consequently, the vertical structure of the ISM at R < Ro is 
best determined by focusing on points in the (1,6, v) data cube that lie near 
the terminal velocity surfaces. Malhotra (1995) presents such an analysis. 

Beyond the solar circle, each (J,6,v) point corresponds to at most one 
location in the Milky Way at R > Rp and the determination of vertical 
structure from the observed distribution in 6 is relatively straightforward 
once the circular-speed curve has been determined. 

At the start of this section we remarked that the disk of the Milky Way 
falls into three parts: the inner (R < 3kpc), middle (3kpc < R < Ro), and 
outer (R > Ro) disk. We postpone discussion of the vertical structure of the 
disk at R < 3kpc to §9.4. 


The middle disk The middle disk, which contains most of the molecular 
gas and star formation, is remarkably flat. More precisely, when one deter- 
mines the central value of z for some tracer of the ISM such as CO, one finds 
that z-(F, ¢) defines a surface that is very nearly a plane for 3kpc < R < Ro. 
In 1958, the Galactic coordinate system (1,6) was reoriented such that this 
plane was well approximated by the plane z = 0 (§2.1.2). 

Figure 9.22 shows the degree to which the HI and CO layers deviate 
from the plane z = 0 by plotting, for HI and CO at 3kpc < R < Ro, values 
of z, that have been estimated from the | > 0 tangent points. These oscillate 
around zero with rms amplitude ~ 30 pc. Since 30 pc is less than 1% of Ro, 
the middle disk is extremely flat. The oscillations apparent in Figure 9.22 
appear to be coherent, however, in the sense that the values of z, for adjacent 
data points are highly correlated. Moreover, the two layers oscillate roughly 
in phase. Hence, it would seem that some dynamical process corrugates 
the disk. It is unclear what this process is. A full description of corrugations 
in the disk can be found in Spicker & Feitzinger (1986). 
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Figure 9.22 The middle disk is very nearly planar. At a given 
radius R, z_ is the value of z at which the emission peaks for HI 
(squares) and CO (triangles). [After Malhotra (1994) and Malhotra 
(1995) from data kindly provided by S. Malhotra.] 
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Figure 9.23 The half-height at half-intensity of the HI (squares) 
and CO (triangles) layers. Open symbols are for | > 0 and filled 
symbols are for 1 < 0. [After Malhotra (1994) and Malhotra (1995) 
from data kindly provided by S. Malhotra] 


Figure 9.23 shows the half-thicknesses at half-intensity 21/2 of the HI 
and CO layers. The thickness of the CO layer is seen to increase slowly with 
R from ~ 35pe at R = 3kpc to ~ 70pc at R = Ro. The HI layer is about 
three times as thick at a given radius. Thus in the middle disk, the ISM 
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forms an extremely thin layer. 

The thickness of the HI and CO layers reflects a balance between the 
tendency of gravity to pull clouds into the equatorial plane and the tendency 
of the energy densities associated with the interstellar magnetic field, cosmic 
rays and the random motions of clouds to push clouds up out of the plane. 
By modeling the effect of random motions on the structure of the (I, v) plot 
near the terminal-velocity curve (see §9.1.2), Malhotra (1995) has shown that 
the line-of-sight motions of HI clouds may be characterized by a velocity 
dispersion oy; = 9+1kms~! independent of R for 0.3 S$ R/Ro < 1. This 
dispersion is in good agreement with the line-of-sight velocity dispersions of 
HI clouds observed at high latitudes |b| (Merrifield 1993) and also with the 
measurements of the 21-cm line widths of face-on galaxies that were cited 
in §8.1.4.4 Of course, in the Milky Way the line of sight usually lies within 
the disk, while for a face-on galaxy it lies perpendicular to the disk. Thus 
it seems likely that the velocity dispersion of clouds in all galaxies is nearly 
isotropic. 

If the pull of gravity on HI clouds in the Milky Way towards the plane 
were balanced only by a velocity dispersion of 9kms~! perpendicular to the 
plane, it is easy to show that the HI layer would be roughly one third as 
thick as it actually is — see Problem 9.7. This calculation demonstrates the 
importance of the magnetic and cosmic-ray pressures for the dynamics of the 
HI layer. 

In §9.2.3 we saw that the random velocities of CO clouds are of order 
7kms~!. Since the random velocities of clouds contribute to the thickness 
of a gas layer as the square of their dispersion, from the dispersions of the HI 
and CO clouds alone we would predict that the HI layer would be nearly twice 
as thick as the CO layer, just as the observations indicate. Since magnetic 
and cosmic-ray pressures play important roles, this calculation is clearly 
too naive. The correct interpretation of the harmony between the relative 
dispersions and layer-thicknesses of the HI and CO clouds is probably that 
the velocity dispersions of the two types of cloud are in dynamical equilibrium 
with the magnetic and cosmic-ray forces on the clouds, with the result that 
the total pressure from all three sources is proportional to the square of the 
velocity dispersion. 


The outer disk In §8.2.4 we saw that the majority of galactic disks are 
warped at large radii. The disk of the Milky Way is no exception, and was, in 
fact, the first warped disk to be discovered: its warp was apparent as soon as 
21-cm data became available for the southern hemisphere (Burke 1957, Kerr 
1957, Westerhout 1957). Our location in the Milky Way proves to be ideally 
suited for the study of the warp, and it has been reanalyzed several times 


4 Figure 5.18 shows that the cooling rate climbs steeply near T = 104 K. Consequently, 
it is very hard to heat the ISM above this temperature and much of the diffuse ISM is at 
temperatures near 104K. At this temperature the sound speed in the ISM lies close to 
10kms~1. Velocities in excess of the sound speed are strongly damped by shocks. 
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since the pioneering investigations of Oort, Kerr & Westerhout (1958) and 
Gum, Kerr & Westerhout (1960), most recently by Diplas & Savage (1991), 
who employed the HI survey of Stark et al. (1992). 

The structure of the warp is most easily understood by first transforming 
the data cube of a 21-cm survey from (1,6, v) coordinates, which are natural 
for the observations, into the usual Galactocentric polar coordinate system 
(R,¢, z) (see Problem 9.8). In this transformation velocities are converted 
to radii using equation (9.3) in conjunction with an assumed circular-speed 
curve u,.(R) = RQ(R). This step involves the implicit assumption that the 
ISM rotates as if it were made up of a series of concentric rigid cylinders. This 
assumption is very unlikely to be precisely valid where the disk is strongly 
warped, but the errors in the derived values of (R, ¢, z) to which it gives rise 
are probably insignificant: the motions that are generated by the warp should 
lie nearly perpendicular to the plane, and therefore make no contribution to 
Vlos: 

The classical analyses of the warp assumed that beyond Rp the circu- 
lar speed was slowly falling, while modern analyses assume that v, is flat 
or slowly rising to the largest observed radii. Changing the shape of the 
assumed rotation curve changes the scale associated with the observed warp 
as the following argument shows. Equation (9.17) is used to infer R from a 
measured value of W, and the slower 22 decreases with R, the larger will be 
the inferred value of R for given W. Consequently, the radial scale of the 
warp was smaller in earlier work than in more recent analyses. 

Figure 9.24 shows the intersections of the HI layer with four cylinders 
that have R equal to 12, 16, 20 and 24kpc. The figure, in which the azimuthal 
coordinate ¢ is plotted horizontally, and z vertically, is for Ro = 8.5 kpc and 
a circular-speed curve that is perfectly flat at v, = 220kms~! for R > Ro. 
The warp is just discernible at R = 12kpc, and becomes pronounced by 
R = 16kpce. For R < 17kpc the warp is well approximated by a simple 
sine curve with an amplitude that rises linearly in R. Beyond 17kpc, a 
pronounced asymmetry develops between ¢ > 0 and ¢ < 0. For R > 11kpc 
a reasonable approximation to the figure of the warp is given by 


R/kpe—-11 
6 


_ R/kpe-11 


2 
Ze F sin @ + 0.3( ) (1 — cos 2¢). (9.18) 


In particular, at ¢ > 0, R = 24kpc the HI layer rises more than 4kpc out 
of the plane z = 0 defined by the middle disk, while at ¢ < 0 the HI layer 
approximately lies within the plane z = 0. The Sun happens to lie near 
the warp’s line of nodes, that is, the line in which the warped outer disk 
intersects the plane of the inner Galaxy. 

Mapping the warp in CO is a painful process for two reasons: (i) in 
the outer disk CO clouds are sparse and have small CO luminosities; (ii) 
the warp is expected to carry clouds well out of the great circle b = 0, with 
the result that a thorough survey must cover a large area of the sky. This 
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Figure 9.24 In each panel 
the grayscale shows the HI 
density on a cylinder of 
radius R with the Galac- 
tic center on the cylin- 
der’s axis. The horizontal 
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Figure 9.25 The half-thickness at half-maximum intensity of the HI and CO layers, 
showing how the increase in thickness with radius continues beyond R = Ro. Results from 
various data sets are shown for the HI layer thickness; the solid line and error bars indicate 
the average and standard error of these results. [Figure kindly provided by R. Olling] 


requirement is especially difficult to satisfy with a dish of the size needed 
to achieve high sensitivity, because this will measure emission from a small 
region with each pointing. On account of these difficulties, the warp has 
been mapped much less extensively in CO than in HI, but the available data 
suggest that the CO layer has the same shape as the HI layer (Grabelsky et 
al. 1987, Digel et al. 1991). Wouterloot et al. (1990) circumvented the need 
for a systematic survey in CO by searching in the IRAS catalogue for IR 
sources that are embedded in molecular clouds. They obtained CO spectra 
in the directions of these objects, and thus deduced their velocities, from 
which kinematic distances followed in the usual way. With this technique 
they were able to trace the warp to R ~ 20kpc, and measure the thickness 
of the CO layer to this distance. 

The warp has also been traced in relatively nearby OB stars (Miyamoto, 
Yoshizawa & Suzuki 1988, Smart & Lattanzi 1996), red-giant stars (Carney 
& Seitzer 1993), discrete sources in the IRAS catalogue (Djorgovski & Sosin 
1989) and diffuse IR radiation from stars (§10.1). There is still no generally 
accepted explanation for the cause of the warp (Binney 1992). 

The increase with R in the thickness of the HI layer that we encountered 
above for R < Ro continues, and indeed gathers pace, at R > Ro. Figure 9.25 
illustrates this phenomenon: 2/2 for HI rises from ~ 250pc at Ro to ~ 
600 pc at 2Ro and to even larger values further out. The variation in 21/2 
with R is of considerable dynamical interest because 2/2 is dictated by the 
hydrostatic balance between the pull of gravity toward the Galactic plane and 
the countervailing pressure within the ISM. Thus, z1/2(R) provides a direct 
measure of the total mass near the plane of the Galaxy as a function of radius 
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Figure 9.26 The circular-speed curve (top) and the half-thickness 
at half intensity (below) of the HI layer as determined by Merrifield 
(1992) using only the 21-cm data of Weaver & Williams (1973,1974) 
and Kerr et al. (1986). 


(Olling 1995). As we saw in §9.2.3, the Milky Way’s rotation curve yields the 
total mass as a function of radius in the Galaxy; combining this information 
with the measurement of how much mass lies close to the Galactic plane then 
allows us to measure the shape of the mass distribution. For R > Ro, where 
most of the mass cannot be explained by the visible components, this analysis 
gives a direct measure of the shape of the massive dark halo. The procedure 
is somewhat complicated by the uncertain contribution to pressure support 
from cosmic rays and magnetic fields, and the warp in the gas layer, but the 
analysis suggests that the dark halo of the Milky Way is slightly oblate, with 
a shortest-to-longest axis ratio of ~ 0.8 (Olling & Merrifield 1998). 

Before we leave the topic of the thickness of the HI layer, we should 
mention a method of determining z;/2(R) and v,(R) outside the solar circle 
without knowledge of the distances to individual objects. Merrifield (1992) 
shows that if the disk is in circular rotation and 21/2 depends only on R and 
not on azimuth ¢, one can determine both z,/2(R) and v,(R) from 21-cm 
data alone. The essence of this method is as follows: we pick a particular 
value of angular velocity, Q; and at each point (J, 6) on the sky we determine 
the intensity of HI emission at velocity vios = (Q; — Qo)Ro sinl. By equation 
(9.3), this slice through the (I, b, viog) data cube originates from material with 
angular velocity 2;. So long as 2(R) is a monotonic function, 2; corresponds 
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to a single radius R;, with the result that all emission in this slice through 
the data cube originates from a cylindrical shell with an (as-yet unknown) 
radius R;. 

If R; < Ro, we lie outside the shell, so the emission is confined to a 
finite range in 1. We can determine R; from the extent of this range, and the 
analysis is essentially identical to the tangent point method of §9.1.1. 

If R; > Ro, we lie inside the cylindrical shell of emitting gas, so the 
emission extends to all J. From these data we may determine for each | the 
distance in |b| at which the emission falls to half its peak intensity. This 
quantity is a function bj/2(;,1) of the chosen angular velocity and |. The 
reason b;/2 varies systematically with | is that the distance d to the ring 
of emission varies with Galactic latitude, and so the angle subtended by 
21/2(R;) varies with 1. Quantitatively, 


by /2(Ri, 1) = arctan(z;/2/d) 
R; 
— arctan( 21/2 ) ) ; (9.19) 


Ro cos! + \/R? — R2 sin” | 


where we have used equation (9.16). The quantities z,/2(R;) and R;(Q;) can 
be estimated by varying them until equation (9.19) most nearly predicts the 
observationally determined function b,/2(9;,1). Of course, once R(Q) has 
been determined by calculating R; for a range of values of Q;, vc(R) follows 
trivially. 

Figure 9.26 shows the circular-speed curve and disk-thickness profile ob- 
tained by this method. The method breaks down for R 2 2Ro because the 
HI layer is too patchy at these large distances for b,/2 to be well defined in 
the HI data. Out to these radii the values of v.(R) and 2z,/2(R) compare 
reasonably well with those shown in Figures 9.18 and 9.25. There is, how- 
ever, a noticeable discrepancy between the circular-speed curves derived by 
these different methods. Kuijken & Tremaine (1994) pointed out that this 
discrepancy is expected if the outer disk of the Milky Way is slightly elliptical 
in shape rather than circular, as we have assumed. 


9.3 Other tracers of the ISM 


9.3.1 Diffuse infrared emission 


As we mention in 88.1.6, our knowledge of the infrared appearance of the 
Universe in general, and the Milky Way in particular, was revolutionized 
in the 1980s by the IRAS satellite. In addition to cataloging ~ 250000 
point sources, IRAS detected diffuse emission over almost the entire sky. In 
the 1990s these measurements were confirmed and extended to longer wave- 
lengths by the DIRBE experiment aboard the COBE satellite. Boulanger & 
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Pérault (1987) analyzed the implications of the IRAS data for the Galaxy’s 
diffuse infrared emission, while Boulanger et al. (1996) analyzed the COBE 
data. 

In Box 3.2 we saw that if we model the distribution of dust in the disk 
by a layer of uniform thickness, the column density of dust along the line 
of sight is proportional to csc |b|. Hence if the diffuse IR radiation detected 
by IRAS and COBE is emitted by dust, we might expect its intensity, when 
averaged over latitude /, to be proportional to csc|b|. The lower panel of 
Figure 9.27 tests this hypothesis by plotting against csc |b| the l-averaged 
100 zm flux density measured by COBE. In preparing this plot, Boulanger 
et al. (1996) omitted data for directions to the Magellanic Clouds and to a 
few prominent molecular clouds, and they have subtracted the contribution 
of dust in the solar system (‘zodiacal emission’ — see §4.2.1). 

For csc |b] < 3 (|b| < 20°) all curves fall nearly linearly in Figure 9.27 in 
accordance with the prediction of the slab model of Box 3.2. For csc |b] < 2.2 
(|b| 2 30°), the flux densities in the two longer wavebands lie slightly above 
straight lines fitted to the data for csc |b] 2 3. Thus at all latitudes the data 
for these longer wavebands is in reasonable agreement with the prediction of 
the model in which the detected IR radiation is emitted by a thin, uniform 
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Figure 9.28 Plot showing how the small-scale fluctuations in 
100 um emission (as measured by the COBE satellite) are corre- 
lated with those in the integrated flux from the 21-cm line (as 
measured in the Leiden—Dwingeloo survey). The small-scale fluc- 
tuations have been calculated by subtracting a smooth model from 
the infrared and HI data. The grayscale shows the number of 0.5°- 
square pixels on the sky where the residual fluctuations have the 
indicated values. The line shows the mean relation between [100 pm 
and 21-cm flux determined from Figure 9.27. [After Boulanger & 


Pérault (1987) using data kindly provided by F. Boulanger and 
G. Lagache] 


dust layer such as that assumed in Box 3.2. 


The distributions of the diffuse IR radiation shown in Figure 9.27 
strongly suggests that this radiation comes from the Galactic plane, but 
do unresolved stellar sources contribute significantly to it? Boulanger & 
Pérault (1987) argue that the answer to this question is ‘no’ because even in 
the 12 zm waveband, to which stars (being hot) would contribute most, the 
integrated radiation from all resolved stars amounts to only 8% of the ob- 
served diffuse flux density, and the emission of unresolved stars is estimated 
to account for < 3% of the diffuse flux density. Thus the ISM accounts for 
~ 90% of the IR luminosity of the Milky Way. 

This conclusion is reinforced by comparison of the 100 wm flux-density 
profile shown in the lower panel of Figure 9.27 with the analogous profile 
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for the 21-cm line that is shown in the top panel. The shapes of these 
two profiles are similar. Figure 9.28 strongly reinforces this conclusion by 
plotting for each 0.5°-square pixel on the sky, the difference between the 
observed 100 ym flux density and the mean flux density at the pixel’s latitude 
b, versus the analogous quantity for the 21-cm line. The diagonal elongation 
of the densely populated part of the figure demonstrates that a pixel that 
is brighter at 100 zm than normal for its latitude, is overwhelmingly likely 
to yield a stronger than average 21-cm signal. The diagonal straight line in 
Figure 9.28 shows the relationship between 100 zm flux and 21-cm flux that 
one derives from the l-averaged data plotted in Figure 9.27. In so far as this 
line lies close to the ridge-line of the densely-populated part of Figure 9.28, 
there is a universal relationship between 21-cm line strength and 100 um flux 
density. If we use equation (8.22) to convert 21-cm line strengths into column 
densities of hydrogen atoms, we may express this relationship as 


j100 pm = 6.66 x 107“ W Hz~! atom=?. (9.20) 


That is, a box of ISM that contains ny hydrogen atoms on the average 
radiates 1.07 x 10-43ny WHz~' at 100m. In particular, there is no room 
for doubt that the ISM is as responsible for the diffuse 100 zm emission as 
it is for the 21-cm line. 

We saw in §8.1.6 that the brightness of the ISM in the shorter IRAS 
wavebands came as a considerable surprise because the importance of small] 
grains had not been appreciated prior to the launch of the IRAS satellite. 
It is still true, however, that most of the luminosity of the ISM emerges at 
A > 60pym: roughly half of the luminosity observed by IRAS falls in the 
100 xm band, with a sixth only in each of the other three bands. COBE was 
sensitive at the longer wavelengths, where most of the flux from the ISM is 
expected to lie. Sodroski et al. (1994) presented a preliminary analysis of the 
data for |b| < 10°. From a comparison of the 140 4m and 240 ym fluxes for 
different fields with the corresponding fluxes in the 21-cm and CO lines and 
in 5 GHz radio-continuum radiation, they concluded that the larger grains 
that dominate the far-IR fluxes, have temperatures ~ 20K and together 
radiate about 1.1 x 101° Lo in the far-IR. For comparison, the Milky Way’s 
V-band luminosity is about Ly = 1.4 x 10/° Lo. Since the dust only emits 
because it is heated by stars, we conclude that interstellar dust reprocesses 
nearly a half of the energy produced by stars. 

Figure 9.29 shows the variation with | of the far-IR luminosity per unit 
gas mass determined by Sodroski et al. (1994). Near the solar circle (I < 90°) 
this is ~ 1.5—3 Lo/ Mo, but it is significantly greater further in.5 With data 
for only two wavebands, Sodroski et al. (1994) could not map the temperature 
of the dust with precision, but they do conclude that its temperature rises 


5 Boulanger & Pérault (1987) estimated a similar total IR luminosity from the shorter- 
wavelength IRAS data. 
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Figure 9.29 Far-IR luminosity per unit gas mass as a function of longitude J. [After 
Sodroski et al. (1994) from data kindly provided by T. Sodroski] 


from ~ 18K in the anticenter direction to ~ 21 K near the center. From more 
extensive COBE data Boulanger et al. (1996) measured the dust temperature 
to be 17.5K. Since the emissivity of dust depends on temperature roughly as 
T® (see Problem 8.7), these temperatures imply that the emissivities of dust 
near the center and beyond the solar circle should be in the ratio ~ 2.5: 1, 
which is crudely consistent with Figure 9.29. 

On small scales the Galaxy’s IR emission is irregular, and much of it is 
concentrated into wisps or streaks on the sky. Low et al. (1984) dubbed this 
wispy component of the emission infrared cirrus. Equation (9.20) gives 
the mean 100 um emissivity per interstellar hydrogen atom. On small scales 
this emissivity varies by up to a factor of 3. It tends to be highest in the 
vicinity of OB associations. A natural explanation of this phenomenon is 
that the emissivity at any point depends on the local flux of UV photons, 
which dominate the supply of energy to grains — see Problem 8.8. 


9.3.2 Pulsars and the Galactic magnetic field 


Measurements of the dispersion measure Dy, (§8.1.4) along lines of sight to 
pulsars that have known distances have been used to determine the mean 
value of the free-electron density, n., in the disk. In most cases the distance 
to the pulsar is estimated from the structure of 21-cm absorption in its spec- 
trum. In a few cases the pulsar is associated with a nebula whose distance 
can be determined by some other method.* A model of the variation of n, 
with Galactocentric radius R and vertical position z within the Galaxy that 
accounts for the available data is 


ne(R, z) = (2.5 + 1.5¢l+1/70Pe) ( ) x 10*m=*, (9.21) 


pe cee! 
1+ R/Ro 


6 The distances to most pulsars are determined from their dispersion measures by 
assuming the validity of a model such as that given by equation (9.21). 
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The Galactic magnetic field B has been probed by all the methods that 
have been used to probe magnetic fields in external galaxies (§8.2.7). In 
particular, the tendency of the Davis—Greenstein effect (§3.7.1) to align the 
electric vector of radiation from obscured stars parallel to B, has been used 
to obtain Figure 3.19, which shows the direction of the Galactic magnetic 
field near the Sun. Averaged over kiloparsec scales, B usually points within 
the plane. On smaller scales, plumes of magnetic field rise tens of parsecs 
above the plane in a way reminiscent of magnetic loops observed in the 
chromosphere of the Sun. 

In §8.1.4 we saw that when both the dispersion measure and the rotation 
measure of a pulsar are known, one can determine the mean value along the 
line of sight (weighted by n.) of the component of B that is parallel to the 
line of sight. Such measurements confirm that the Galactic magnetic field 
is highly organized on kiloparsec scales and also enable us to determine the 
magnitude of the field. The mean field runs azimuthally around the Galaxy. 
At the solar radius and beyond, the field points in the direction of Galactic 
rotation. Roughly 500 pc inside Rp, the direction of the field reverses, so 
that it points in the opposite direction to Galactic rotation in the Sagittarius 
spiral arm (§9.2.1). At a Galactocentric radius of ~ 5.5 kpc, B seems again 
to reverse its direction (Rand & Lyne 1994). 

As Figure 3.19 makes clear, on small scales B wiggles. The importance 
of the wiggles is usually quantified in terms of a picture in which the field is a 
superposition of an ordered large-scale component and a random small-scale 
component. The ratio of the RMS values of the ordered and small-scale fields 
is a measure of the importance of wiggles. Measurements of Dy and Rm for 
pulsars are sensitive only to the ordered field. They indicate that this has 
magnitude ~ 0.2nT near the Sun, rising to ~ 0.6nT at R = 4.5kpc (Rand 
& Lyne 1994). 

The magnitude of the disordered component of B is much harder to 
estimate. Probably the most reliable estimates are obtained from the typical 
intensity jsync of Galactic synchrotron emission by the argument given in 
§8.2.7. Berkhuijsen (1998) finds that B decreases from ~ 1nT at r = $ Ro 
through ~ 0.6nT at Ro to ~ 0.3nT at R = 2Ro. Taken with the estimates 
of the magnitude of the ordered component of B cited above, these values 
for B imply that the random component of B is at least two and a half times 
larger than the ordered field. As in external galaxies, it seems that B is 
most highly ordered between spiral arms and becomes relatively disordered 
in spiral arms. 

In a few cases there is convincing evidence that B is larger in molecular 
clouds than in lower-density interstellar space. For example, from a study of 
the polarization of 21-cm radiation from the Orion B cloud, van der Werf et 
al. (1993) concluded that, as one probes deeper into the cloud and ny rises 
from 10° m~* to 10"! m~°, B increases from ~ 3nT to 6nT. 

The strongest interstellar magnetic field known occurs in the filaments 
of synchrotron emission that are pictured in Figure 9.30. These filaments are 
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Figure 9.30 A square 85 pc on a side mapped in 1.65 GHz contin- 
uum emission. The plane runs from top left to bottom right and 
the radio source Sgr A, which contains the Galactic center, visible 
as a big blob of emission just below and to the right of the center of 
the panel. About 30 pc from Sgr A several filaments of synchrotron 
emission extend perpendicular to the disk for a few parsecs. [Af- 
ter Yusef-Zadeh (1989) using data at http: //imagelib.ncsa.uiuc- 
.edu/project/preview/95.FY.01.01] 


located about 30 pc from the Galactic center and extend perpendicular to 
the Galactic plane for a few parsecs. From the brightness of the synchrotron 
emission in them one infers B ~ 100nT. Since magnetic stresses scale as 
B?, such an intense field would exert forces that are 10000 times larger than 
those exerted by B at a typical point in the ISM, where B S$ 1nT. These 
objects are not fully understood, but their straightness probably reflects 
the ability of a dynamically dominant magnetic field to resist the tendency 
of interstellar turbulence to tangle field lines. A few cases of filaments of 
synchrotron emission that run perpendicular to the plane are known nearer 
to the Sun (Wieringa et al. 1993). 


9.3.3 Diffuse Ha radiation 


Fabry-Perot spectrometers detect diffuse Ha radiation at all Galactic lati- 
tudes (Reynolds 1993). The intensity of this radiation varies irregularly over 
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the sky, but the average intensity within a window that is several degrees in 
size is roughly proportional to csc |b], where 6 is the latitude of the window’s 
center. This scaling with csc |b| suggests that the radiation originates in a 
layer centred on the plane — see Box 3.2. The production of the observed Ha 
emission within this layer requires a large power input to the gas of the layer: 
when the data are analyzed along the lines of §8.1.3, one finds that the mini- 
mum power requirement is ~ 10-7 W m~? of the disk, or ~ 4x 10*4 W for the 
disk out to a Galactocentric distance of 12 kpc. It is interesting to compare 
this power requirement with the average power input by supernovae, which 
are thought to be one of the major energy sources for the ISM. If there is one 
supernova per 30 years in the Milky Way, then their average power input to 
the ISM is ~ 5 x 1094 W. Thus the diffuse Galactic Ha emission is associated 
with a major item in the energy budget of the ISM. 

The observed Ha emission is not associated with detectable emission in 
the 630nm and 520nm lines of neutral oxygen and nitrogen, respectively. 
This absence of N and O lines suggests that the diffuse Ha emission comes 
from gas that is nearly fully ionized. Consequently, in this gas the free- 
electron density is a reliable guide to the total mass density. We saw in 
§8.1.5 that the column density of free electrons along the line of sight to 
a pulsar can be determined from the pulsar’s dispersion measure. Column 
densities determined in this way for lines of sight that reach far out of the 
plane are N ~ 7 x 108 csc|b|m~?. When the contribution to the mass 
density of helium atoms is taken into account, one concludes that there is 
1.6. Mo pe~? of ionized gas in the disk near the Sun. 

If we assume that the particle-density n of the ionized gas varies with 
height z above the plane as 


n(z) = n(0)e~7/*, (9.22) 


then we can estimate the scale-height zo of the gas from the value of n(0) 
that is obtained by observing nearby pulsars, and the total column density 
N along latitude b = 90°. Reynolds (1993) concludes that n(0) ~ 2.5 x 
10-8 m-, and thus that zp ~ 1kpc. 

From the irregularity of the observed Ha flux on the sky it follows that 
the ionized gas from which it comes is inhomogeneously distributed. It is 
likely that it fills only a small fraction < 10% of space near the plane, but 
2 50% of space at z ~ 1kpc. 


9.3.4 Diffuse synchrotron and y-radiation 


Until recently, the angular resolution with which y-rays have been detected 
was extremely poor, and even now it remains poor by comparison with the 
angular resolutions routinely employed in other branches of astronomy. A 
consequence of this lack of resolution is that it is not clear how much of the 
observed y-radiation is contributed by point sources. That point sources do 
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Figure 9.32 The dependence upon longitude | of the 70—150 MeV 
y-ray flux measured by the COS-B satellite in the northern and 
southern Galactic hemispheres. [After Bloemen (1993) from data 
kindly provided by H. Bloemen] 


contribute significantly follows from the fact that observed y-ray fluxes are 
time-variable (Skibo, Ramaty & Leventhal 1992). 

Figure 9.31 shows the spectrum of the Milky Way’s diffuse y-radiation. 
It extends from 0.1 — 10* MeV and can be crudely characterized as a power 
law q,(E) x E~?. The shape of this spectrum is reasonably well understood 
in terms of the processes discussed in §8.1.4. 

Figure 9.32 shows the flux of 7-rays as a function of longitude | for both 
northern and southern Galactic hemispheres. The flux is weakly peaked 
around | = 0. Theoretical models of the y-ray intensity tend to predict a 
more centrally peaked y-ray flux than that shown in Figure 9.32; as we saw 
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in §8.1.4, the y-ray emissivity should be proportional to the product q,n of 
the cosmic-ray flux g, and the density n of the ISM, and both of these are 
expected to peak near the Galactic center. 

Cosmic rays not only give rise to y-rays, but also to synchrotron ra- 
diation. Consequently, by observing diffuse synchrotron radiation, we can 
obtain additional information regarding the Galactic distribution of cosmic 
rays. In §8.2.1 we saw that in disk galaxies the 1.4GHz radio-continuum 
brightness temperature T;,4, which measures the strength of synchrotron ra- 
diation, is accurately proportional to the intensity Ico of 2.6mm radiation 
by !2CO molecules. Allen (1992) shows that the same proportionality holds 
within the disk of the Milky Way, point by point, for Galactocentric radii in 
the range 2 — 12kpc. This result is striking because over this radius range 
T,.4 and Ico vary by two orders of magnitude. The most natural interpreta- 
tion of the correlation is that the cosmic rays responsible for the synchrotron 
emission also heat the molecular clouds, increasing the intensity of their CO 
emission (Suchkov, Allen & Heckman 1993). Thus, the value of Ico from any 
point depends on the cosmic-ray environment as well as the total amount of 
CO at that point. The common use of Ico as a direct measure of the molec- 
ular content of the Milky Way and other galaxies (§§9.6.1 and 8.1.4) should 
therefore be viewed with some caution. 


}.3.5 Diffuse X-rays 


The distribution of diffuse X-radiation in a band centred on 0.2keV is 
strongly concentrated towards the Galactic poles. More generally, in this 
vand the X-ray flux in each direction is strongly anti-correlated with the 
‘trength of the 21-cm line in that direction. A natural interpretation of this 
inding is that the X-rays come from outside the relatively thin layer to which 
1eutral hydrogen is confined, and at low latitudes are absorbed by cool gas 
m their way to us. If this interpretation were correct, X-rays of lower en- 
rgy, which are more strongly absorbed by cool gas, would be even more 
trongly concentrated to high latitudes. Actually the distribution of lower- 
nergy X-rays is remarkably similar to that observed for the band centred 
n 0.2 keV. 

These findings led Snowden et al. (1990) to hypothesize that the Sun sits 
a a ‘local bubble’ of hot X-ray emitting gas that is embedded in the layer 
f cool gas. In this model, the X-ray and 21-cm fluxes are anticorrelated 
ecause the X-ray flux is small and the 21-cm flux is large in directions in 
rhich we are near the bubble’s edge, while in directions in which we are far 
‘om the bubble’s edge the X-ray flux is large and the 21-cm flux is small. 
ubsequently, the ROSAT satellite demonstrated that at least part of the 
orrelation between X-ray and 21-cm fluxes is generated by absorption, so 
1e situation is now rather confused. 

A major problem that dogs attempts to study the ISM with X-rays is 
iat of determining how much of the observed diffuse flux is of extragalactic 
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origin. It is known that a large part of the high-latitude soft X-ray flux 
derives from unresolved quasars and Seyfert galaxies (Hasinger et al. 1993). 


9.4 The central disk 


We now turn to the study of the ISM inside R ~ 3kpc, which corresponds 
to |l| S 20°. The structure of this region proves to be highly complex, 
and after forty years of study there is barely a consensus as to even its 
basic form. Here we shall argue the case for what seems to us the most 
plausible interpretation of the observations, while trying to deal fairly with 
some alternative interpretations. Moreover, “most plausible” should not be 
read as “plausible.” If the picture we shall present is at all close to the truth, 
the center of our galaxy is a very strange place indeed. 

Before we plunge into the detail of the observations, it is worth taking 
an inventory of the relevant data. The most useful data are from line obser- 
vations of various species, especially HI, CO, CS, OH, and H2CO (formalde- 
hyde), which we now discuss. 


9.4.1 21-cm observations 


The 21-cm line of HI is important because it can be detected over almost 
the entire region (|l| < 20°, |b| < 10°). However, the interpretation of the 
21-cm spectra for some crucial lines of sight is delicate because the observed 
spectra are significantly affected by absorption. Two phenomena contribute 
to absorption in the observed spectra. First and most obviously, some of 
the 21-cm line radiation emitted in the center region is absorbed in the 
great column of gas that lies in the middle disk between us and the center. 
Since this material is moving on nearly circular orbits, it contributes strong 
absorption only at the velocities that lie in the interval (0, 300sin/)kms~? 
characteristic of circular orbits in the middle disk (see Problem 9.10). 

A second contribution to absorption in spectra observed at small |I| 
and |b| arises from diffuse 21-cm continuum emission from the region (|I| S 
1.2°, |b| S 0.2°) (Reich et al. 1984). As we saw in §8.2, when a background 
continuum source is observed through a screen of HI atoms, it contributes 
to the spectrum more strongly far from line-center, where there is negligi- 
ble absorption, than it does at line-center. Consequently, the line appears 
weaker in emission than it would if the continuum source were not present. 
If the brightness temperature of the weakened line is blindly inserted into 
equation (8.22), a spuriously low value of the column density Ny will be 
derived. 

There are two possible strategies for dealing with this problem of per- 
vasive absorption. One is to calibrate the absorption with the aid of lines of 
molecules such as OH and H2CO that absorb radiation without themselves 
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Figure 9.33 The brightness temperature in the 21-cm line aver- 
aged over |b| < 0.5° the central region of the Galaxy. [After Liszt 
& Burton (1980) and Burton & Liszt (1983) from data kindly pro- 
vided by H. Liszt] 


emitting significantly. The other strategy is to abandon the 21-cm data and 
observe the region of interest in lines of molecules such as CO that are less 
affected by absorption; at the high frequencies of these lines, the brightness 
temperature of continuum emission is negligible. A major problem with both 
strategies is that tracers other than HI have very low densities in many ar- 
eas of interest, so some dynamical information can only be gleaned from the 
21-cm line, with all its attendant problems. 

In Figure 9.13 we saw that the envelope of the 21-cm emission at (/ > 
0, v > 0) has a pronounced horn at (J = 15°, v = 140kms7') and falls 
steeply from there towards the center, and we interpreted this phenomenon as 
evidence for a hole of radius ~ 3 kpc in the centre of the HI disk. This hole is 
not entirely devoid of HI, however. Figure 9.33 is an (1, v) plot for the region 
(|l| < 12°, |b| < 0.5°), which is said to be occupied by the central disk. (The 
grayscale of this plot corresponds to much lower brightness temperatures 
than the faintest levels shown in Figure 9.13 with the result that matching 
Figures 9.13 and 9.33 is difficult.) At (1 > 0, v > 0) the envelope of the 
emission in Figure 9.33 sweeps up from v ~ 180kms~! at 1 = 12° to reach 
u ~ 270kms—! at 1 ~ 3°. The envelope then rounds a horn and plunges 
steeply but crosses 1 = 0 at v ~ 200kms~!. Recall from §9.1.1 that when 
l is small and negative, positive velocities are strictly forbidden if the disk 
is in circular rotation. In reality, the upper envelope of the emission in 
Figure 9.33 reaches | ~ —5° before dropping below v = 100kms~!. This 
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constitutes unmistakable evidence that within the central disk the orbits of 
HI clouds are not even approximately circular. 

Since so many external disk galaxies have bars, the natural reaction to 
this conclusion is to ask whether the central region of our own galaxy does 
not contain a bar, and to interpret Figure 9.33 in the light of our discussion 
of the (1, v) traces of oval orbits in §9.1.1. In particular, we have seen that the 
(1, v) traces of oval orbits cut the line | = 0 at non-zero v (see Figure 9.8), so 
an ensemble of oval orbits may be able to account of the fact that the upper 
envelope in Figure 9.33 cuts 1 = 0 at v ~ 200kms~?. Recall from Figure 9.8 
that the (1, v) trace of an oval orbit is roughly symmetrical under inversion 
through the origin. From this it follows that the envelope in the (I, v) plot 
of emission from a disk made up of oval orbits should also be symmetrical 
on inversion through the origin. 

Now consider the lower envelope in Figure 9.33. At 1 = —11° this 
lies at v ~ —200kms~—! and drops under a marked bump to a horn at 
I~ 4°, vy ~ —270°, very approximately as if it were the inversion through the 
origin of the envelope at (1 > 0, v > 0). But from (/,v) = (0, —200) the lower 
contour moves almost vertically towards the origin exactly as expected if the 
central disk were in circular rotation (cf. Figure 9.3). Moreover, Figure 9.33 
is significantly unsymmetrical on inversion through the origin in that there is 
no counterpart at (1 > 0, v < 0) of the forbidden emission at (I ~ —5°, v > 
100kms~*). In fact, we are confronted with a paradox: the upper envelope 
in Figure 9.33 suggests that the HI moves on oval orbits, and the lower 
envelope seems to imply circular orbits. 

This paradox has perplexed astronomers since it first emerged when the 
northern-hemisphere data taken in the Netherlands were compared with the 
southern-hemisphere data taken in Australia. Burton & Liszt (1978) and 
Burton (1992) argue powerfully that the paradox can be solved by taking 
into account two factors: (i) 21-cm emission from the part of the central disk 
that lies in front of the Galactic center is strongly masked by absorption of 
continuum radiation, and (ii) the central disk, amazingly, does not lie in the 
plane b= 0. 

Since the HI of the middle disk contributes significantly to absorption 
only at velocities in the interval (0,300sin/) kms~', the effects of absorption 
by such gas can be largely eliminated by focusing on emission at veloci- 
ties outside this interval. Figure 9.34 shows a map of the 21-cm bright- 
ness temperature integrated over —300kms-? < v < —100kms~! and 
100kms~1 < v < 300kms~}. Absorption against the Galactic-centre con- 
tinuum emission is clearly apparent around the origin of the panel for v < 0, 
while the other panel shows no trace of such absorption.” Evidently the 
near side of the central disk is approaching us at (1,b) = (0,0). Atl > 0 


7 (1,v) plots of absorption by OH and HzCO molecules (Cohen & Few 1976, Cohen & 
Few 1981) confirm that in front of the nucleus there is more material at v < —100kms—! 
than at v > 100kms7?. 
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Figure 9.34 The 21-cm brightness temperature integrated over —300kms~! < uv < 
—100kms~? (right) and 100kms~! < v < 300kms~! (left) [After Burton (1992) from 
data kindly supplied by H. Liszt] 


Figure 9.35 The spider diagram of a nearly edge-on elliptical disk. 


the ridge-line of emission at allowed velocities (v > 0) runs almost straight 
along b = 0, while at 1 < 0 the ridge-line of emission at allowed velocities 
has a slight tendency to run 0.5° above b = 0. Almost all the material with 
forbidden velocities at 1 > 0 lies at b < 0, while all the material at 1 < 0 with 
forbidden velocities lies at b > 0. 

This last antisymmetry between b > 0 and b < 0 suggests that the 
central disk may not lie exactly in the plane b = 0. In §8.2.4 we investigated 
the projected velocity fields or spider diagrams of inclined disks. What would 
the spider diagram of the central disk look like if the disk were arbitrarily 
thin and could be studied at arbitrary angular resolution? Figure 9.35 shows 
the spider diagram of an elliptical disk that is viewed at inclination 1 = 82°. 
The long axis of the disk makes an angle of 20° with the projection of the 
line of sight into the disk’s plane, and the orbits of clouds in the disk have 
axis ratio 0.74. The systemic-velocity contour is the straight contour that 
slopes up and to the right. If the disk were circular, this contour would be 
perpendicular to the (horizontal) line of nodes. Forbidden velocities occur in 
the wedge between the systemic-velocity contour and a vertical line through 
the center. Thus if the central disk were elliptical and oriented like the 
disk that underlies Figure 9.35, the forbidden velocities would be confined to 
(1 > 0, b < 0) and (1 < 0, b > 0), just as observed. In particular, the nearer 
end of the disk’s long axis lies at 1 > 0, b < 0. 

Three angles are required to specify the orientation of an elliptical disk: 
two angles specify the orientation of the normal to the disk, and the third 
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specifies the direction of the disk’s long axis within its plane. Two observa- 
tions indicate likely values of this last angle: 
e The values of |v| at which the envelope of emission in Figure 9.33 crosses 
1 =0 are large, ~ 150kms~?. To achieve such substantial radial veloc- 
ities v, anywhere around an elliptical orbit, the latter has to be pretty 
elongated (see Problem 9.11). In view of this, it is likely that the orbits 
happen to be orientated such that the line of sight to the center cuts the 
orbits near where v, peaks. Given that the orbits are quite elongated, 
it follows that the angle between the line of sight and their long axes is 
~ 20° (Binney et al. 1991). 
e The conclusion we have just reached, that the line of sight lies near 
to the long axis of the bar, is confirmed by the shape of the terminal- 
velocity curves at 2° < |l| < 12°. In fact, it naturally reconciles the 


strong rise in jo? | as |l| decreases in this interval with the outwards- 

increasing circular-speed predicted by the near-IR brightness profile of 

the Milky Way — see §10.1. 

What about the orientation of the normal to the central disk? Let 
(8n,¢n) be the usual polar angles of the normal in a system of spherical 
polar coordinates oriented such that b = 0 and 6 = 7/2 are the same plane 
and the Sun-center line coincides with (6 = 7/2,¢ = 0) - see Figure 9.36. 
Then it is clear that 6, is small because we see the disk nearly edge-on. Now 
consider two contrasting cases as regards ¢,,. If |dn| ~ 7/2, we would see the 
central disk nearly edge-on, and emission at the largest values of |l| and large 
allowed velocities would come from well above or well below b = 0, unless 6, 
were negligible. Since Figure 9.34 shows that emission at allowed velocities 
peaks near b = 0 we can rule out this case. If ¢, ~ 0, the further end of 
the disk will be at b > 0 and the nearer at b < 0, just as we deduced above. 
Moreover, in this case, emission at large |I| and large allowed velocities will 
come from points near b = 0, as observed. Finally, in order to ensure that 
forbidden velocities occur at b > 0 for 1 < 0 we must have ¢, < 0. Thus 
both 6, and |¢,| are small, and ¢, is negative. 
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Figure 9.37 A slice of the 21-cm (I, b,v) data cube parallel to the 
plane 6 = —I tan 22°. Data for the pixel around | = b = 0 are taken 
to be the average of the data for ! = 0,b = +0.5° because at 1] = b= 
0 the spectrum is strongly affected by absorption. Notice that this 
slice, in contrast to that parallel to b = 0 shown in Figure 9.33, is 
nearly symmetric under inversion through the origin. [After Burton 
& Liszt (1978) from data kindly supplied by H. Liszt] 


To quantify 6, we argue from Figure 9.35 that the extent bmax in b of 
the forbidden emission is ~ a, sin 6,/Ro, where a, is the length of the disk’s 
semi-major axis and we have approximated by unity the cosine of the angle 
between the Sun-center line and the bar’s long axis. In this approximation, 
the extent Imax in |l| of the emission at permitted velocities is a,/Ro, where 
ay is the length of disk’s semi-minor axis. Figure 9.34 suggests bmax ~ 3° 
and Imax = 12°. Thus 


(9.23) 


From Figure 9.43 below we conclude that a,/ay ~ 2 and hence that 6, < 7°. 
This analysis is not the only possible interpretation of the data. In a 
series of papers summarized in Burton (1992), Burton & Liszt have argued 
for a slightly different orientation of the central disk than that derived above: 
they favor (0, = 24°, dn = —33°) and envisage a rather large angle (51°) 
between the bar’s major axis and the sun-center line (see Problem 9.12). 
Conventional (i,v) plots display slices through the (I,b,v) data cube 
that are parallel to a plane b = constant, but we can make an (I, v) plot by 
slicing through the data cube in almost any direction. Figure 9.37 shows the 
result of slicing the data cube parallel to the plane b = —0.4l, close to the 
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Figure 9.38 (I, v) plot of 13CO emission at |I| < 5° averaged over 
|b} < 0.6°. [After Bally et al. (1988) from data kindly supplied by 
A. Stark] 


line of nodes inferred by Burton & Liszt (1978). The resulting plot shows a 
high degree of symmetry on inversion through the origin, just as we would 
expect if the central disk were made up of oval orbits. Switching from the 
plane b = 0 to the plane 6b = —0.4l modifies the (J, v) plot in two ways. First, 
it diminishes the importance of absorption by concentrating on lines of sight 
that lie further from the plane, and therefore pass through less foreground 
HI. Second, it enhances the strength of the forbidden emission because, as 
we have seen, this is concentrated below and above b= 0 at! > 0 and! <0, 
respectively. 


9.4.2 Observations in lines of CO and CS 


CO emission from the central disk is clearly visible in Figure 9.15 in the form 
of great spikes at |l| < 4° that reach to |v] > 180kms~!. At v < 0 these 
spikes are confined to smaller values of |l| than at v > 0. Figure 9.38 shows 
at higher angular resolution the }7CO emission in the region || < 5°, which 
corresponds to R S 700 pc at the Galactic center. A band of strong emission 
runs horizontally through the figure. This emission is associated with gas 
that lies well outside the inner 700 pc and serves only to confuse study of 
the central region. Emission from central material can be most easily traced 
at |v| 2 30kms~!. On the extreme left in Figure 9.38 a feature centred 
on (I,v) = (3°,100kms~?) is conspicuous. This tall thin feature consists 
of emission from gas that is confined to ~ 0.6° (80 pc) in / yet distributed 
over ~ 100kms~? in velocity ~ this gas is called Bania’s Clump 2. Several 
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Figure 9.39 (I,v) plot for J = 2 — 1 line of CS (top) and the J = 1 — 0 line of 12CO 
(bottom). In both panels emission has been averaged over |b| < 0.1°. No CS spectra 
are available at 1 < —1°. Contours are spaced by 0.15K and 1K in the upper and lower 
panels, respectively. [After Binney et al. (1991)] 


similar clumps of gas are seen in the inner few degrees of the Galaxy, while 
nothing of comparable scale is seen further out. 

The intensity of the 1*CO emission in Figure 9.38 also peaks strongly at 
(I, v) = (0.66°,55kms~*). This peak is due to the Sgr B2 complex. It has 
an angular diameter of ~ 0.25° (35 pc) and several lines of argument suggest 
that its mass lies near 3 x 10® Mo (Bally et al. 1988). 

Sgr B2 lies along a ridge of intense molecular emission that is most 
clearly seen in the upper panel of Figure 9.39. This diagram shows emission 
in the J = 2 — 1 line of CS. Since CS is a highly dipolar molecule, its emission 
traces the densest gas (§8.1.4): whereas CO molecules are collisionally de- 
excited at densities ny 2 3 x 10°m 7%, CS molecules are only collisionally 
de-excited for ny 2 1012m7~3. Hence Figure 9.39 traces the distribution of 
the densest gas and shows that this gas is confined in the (I,v) plane to a 
ridge, which slopes gently downward through the location of Sgr B2 towards a 
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hot spot at (I,v) ~ (—0.1,55kms~'). This hot spot marks the location of the 
Sgr A molecular complex. In Figure 9.39 data are not available for 1 < —1°, 
but there is a striking absence of significant emission at —1° < 1 < —0.2°. 
Other (I,v) plots confirm that three-quarters of the molecular gas at the 
Galactic center lies at positive longitudes. We do not know why. 

Much of the emission in Figure 9.38 falls within a parallelogram that 
extends from (I, v) ~ (1.8°,200kms~') at top left to ~ (—1.4°,-200kms~!) 
at bottom right. The lower panel in Figure 9.39 shows this parallelogram 
more clearly by plotting the intensity of *CO emission averaged over |b| < 
0.1°. By emphasizing emission from more tenuous gas this plot reveals that 
the parallelogram extends as far as 1 = 2 at positive longitudes and to 1 ~ 
—1.4 at negative longitudes, and to more than +200kms~? in v. 

Bally et al. (1988) find that the line on the sky of peak molecular emis- 
sion is aligned with the line b = 0 to within ~ 1.5°, but lies about 0.03° 
below b = 0. In §9.5 below we shall see that Sgr A*, which is thought to 
mark the dynamical center of the Galaxy, lies 0.046° below b = 0. The width 
of the emitting layer varies with 1, but between | = 0.2° and | = 0.6° is typ- 
ically 0.13° (18 pc at the Galactic center). If this width reflects the random 
vertical velocities of emitting gas clouds, the velocity dispersion of the clouds 
would have to be of order o, = 54kms~!. This dispersion is considerably 
larger than that of giant molecular clouds near Ro, but comparable to the 
line-of-sight velocity dispersion within features such as Bania’s Clump 2. 


9.4.3 A dynamical model of the central disk 


We have seen that 21-cm data suggest that gas observed at longitudes of a few 
degrees is moving on highly eccentric orbits. Binney et al. (1991) proposed 
a dynamical model of the inner few kiloparsecs of the galaxy that connects 
this phenomenon both with the morphology of molecular emission at |!| < 2° 
and with the near-infrared photometry of the Galaxy that is the subject of 
§10.1. Figure 9.40 illustrates the idea behind this model by showing a series 
of closed orbits in a simple rotating barred potential. Two distinct types 
of orbit are evident: (i) at the largest radii shown the orbits are distinctly 
elliptical and become progressively more elongated as one goes further in — 
eventually an orbit is reached that has sharp cusps at its apocenters; (ii) 
inside this cusped orbit the orbits are again elongated, but form ovals that 
are oriented perpendicular to the cusped orbit. The family of orbits that 
culminates in the cusped orbit is called the family of x, orbits, while the 
family of orbits that lie inside the cusped orbit is called the family of x2 
orbits. The transition from the x, family to the x2 family is associated 
with the inner Lindblad resonance of the given bar potential — see §3.3 of 
BT. The x2 orbits are typically less elongated than the x, orbits, and their 
elongation increases rather than decreases with increasing radius. 

If cold gas is injected into in a barred potential, it will settle quickly to 
approximately closed orbits and then drift slowly inwards by moving con- 
tinuously through a sequence of ever smaller closed orbits. This pattern of 
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Figure 9.40 Closed orbits in a simple rotating bar potential (upper panel) and their 
traces in the (I, v) plane (lower panel) when viewed from a point that lies 8kpc down the 
dotted line in the upper panel. {After Blitz et al. (1993).] 
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Figure 9.41 The smallest of the orbits of Figure 9.40 shown on an enlarged scale. The 
biggest orbit shown is the cusped orbit and the orbits inside it belong to the x2 orbit 
family. 


gradual orbital evolution breaks down when the cusped orbit is reached, be- 
cause smaller orbits of the x, sequence have small loops at their apocenters 
— see Figure 3-16 of BT. Hence they are self-intersecting orbits, and loss 
of kinetic energy as parcels of gas at different points around such an orbit 
crash into each other will quickly drive the gas to substantially smaller radii. 
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Thus once gas reaches the cusped orbit, it quickly transfers to much smaller 
orbits of the x2 family, and resumes its slow inward drift. Once gas is on £2 
orbits, its inward drift slows because energy is lost more slowly on rounder 
orbits, and the x2 orbits are not only rounder than the majority of x1 orbits, 
but become even rounder as one goes to smaller radii. If the disk is in an 
approximately steady state, the surface density of gas will be highest where 
gas drifts inwards most slowly, and there should be a significant build-up of 
gas on £2 orbits. 

In the model of Binney e¢ al., the Sun-center line makes an angle of 
about 20° with the long axis of the bar, which coincides with the long axis 
of the x; orbits. Most 21-cm emission is interpreted as arising from gas on 
x1 orbits, while the dense molecular gas that is traced by CS emission is 
on £2 orbits. Consequently, the distribution of emission in Figure 9.37 is 
interpreted in terms of the (I, v) traces of x, orbits, while that in the upper 
panel of Figure 9.39 is interpreted in terms of the (I, v) traces of x2 orbits. 
These traces are shown in the lower panel of Figure 9.40 and the right-hand 
panel of Figure 9.41. 

As we expect from the discussion of §9.1, the (J,v) traces of a series 
of x; orbits of decreasing size and therefore increasing elongation, become 
steadily more open, and the (I, v) trace of the cusped orbit approximates a 
parallelogram. Binney et al. suggested that the parallelogram apparent in the 
lower panel of Figure 9.39 is a modified form of the parallelogram associated 
with the cusped orbit — the observational and theoretical parallelograms differ 
significantly because gas-dynamical effects such as shocks cause gas to move 
on significantly non-closed orbits near the transition from x, to xq orbits. 
Since the parallelogram in Figure 9.39 extends to | ~ 2°, the potential is 
probably such that the (i,v) trace of the cusped orbit extends to | ~ 2°. 
This model does not explain why the CO emission is so very asymmetrical 
in longitude. 

If the 21-cm emission does come from material on x; orbits, the upper- 
left and lower-right envelopes of the emission in Figure 9.37 should corre- 
spond to envelopes of the (l,v) traces of x; orbits in the lower panel of 
Figure 9.40. A good family likeness between these envelopes is apparent — 
see Figure 9.42. In particular, the downward sweep away from | ~ 2° of the 
upper-left envelope in the observational figure is reproduced by the theoret- 
ical figure. This downward sweep does not reflect a strong decrease with R 
in the strength of the inward gravitational pull of the Galaxy, but arises be- 
cause as / increases we are seeing less and less elongated orbits rather nearly 
end-on; at | ~ 2° emission near the envelope in the observational (1, v) plot 
comes from material that is zipping past the center on its way from the near 
cusp to the far one, while further out emission near the envelope comes from 
material on more nearly circular orbits. 

The emission from dense gas in Figure 9.39 nearly all arises at velocities 
v S$ 100kms~+ that are markedly smaller than those, v 2 200kms7!, as- 
sociated with the corresponding envelope for HI in Figure 9.37. Figure 9.42 
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Figure 9.42 Top panel: the (/,v) traces of the x; orbits of Figure 9.40 superimposed 
on the HI distribution of Figure 9.33. Lower panel: the (I,v) traces of the z2 orbits of 
Figure 9.41 superimposed on the CS distribution of Figure 9.39. 


shows that the Binney et al. model provides a natural explanation for this 
contrast: on account of the orientation of the x2 orbits, the highest-velocity 
CS emission is dominated by material that is near apocenter and therefore 
going most slowly around its orbit, rather than being near pericenter as is 
the HI that is seen on x; orbits further out. Moreover, this consequence 
of the different orientations the two families of orbits is amplified by an- 
other circumstance. The x2 orbits that are associated with CS emission are 
physically much smaller than the x; orbits that are associated with 21-cm 
emission, even though the largest x2 orbit does not differ greatly in pro- 
jected size from the smallest x; orbit. Consequently, the CS velocities probe 
the Galactic force-field at substantially smaller radii than do the 21-cm ve- 
locities. In the Binney et al. model, the circular speed associated with the 
axisymmetric component of the Galaxy’s potential increases slowly with R 
(quantitatively, v. «x R°), so the small x2 orbits have smaller characteristic 


592 Chapter 9: The Milky Way’s ISM 


velocities than do the much bigger 2, orbits. 

If the Galaxy were axisymmetric, the downward sweep of the envelope 
of 21-cm emission at | > 2° in Figure 9.37 would imply that the circular 
speed was a decreasing function of R at 0.2kpc S$ R S$ 1kpc. Older mass 
models of the Galaxy usually include a compact central mass component 
to ensure that the circular speed behaves in this way. In the Binney et al. 
model, this downward sweep is entirely due to the decreasing elongation of 
closed x; orbits as R increases, and occurs despite v, being an increasing 
function of R. 

As we saw in §9.4.1, the concentration of forbidden velocities at b > 0 
for 1 < 0 and at b < 0 for / > 0 arises naturally if the plane of the x, orbits 
is tipped by a few degrees about a line that lies nearly in the plane of the 
sky so that the far ends of these orbits lie at b > 0. It is not clear whether 
the plane of the x2 orbits is similarly tipped, but the relatively small extent 
of these orbits along the line of sight ensures that a small tip of their plane 
would not be conspicuous. 

As we noted above, pressure forces will ensure that gas clouds do not 
move on exactly closed orbits, especially near the transition from the x, 
to x2 orbits. Therefore it is important to test a simple picture such as 
that of Binney et al. (1991), which is based on the characteristics of closed 
orbits, with hydrodynamical simulations of gas flow in barred potentials. 
Many such simulations has been performed, using a variety of numerical 
techniques to simulate the hydrodynamics of the ISM (which is in detail 
an extremely complex fluid). By and large, there is reasonable agreement 
between the results of these simulations.? Figure 9.43 shows a simulation of 
gas flow in a model of the Galactic potential that is based on near-infrared 
photometry of the Milky Way —- see §10.1. Comparison of the (I,v) plot 
of this simulation, which is shown in the lower panel of Figure 9.43, with 
the observational (J,v) plots of Figures 9.13 and 9.15 shows that the most 
prominent features in the observational plots have theoretical counterparts. 
Unfortunately, the observational features tend to occur at larger longitudes 
and velocities than their theoretical counterparts. Hence, while it is clear 
that the general picture of a rotating bar at the Galactic center is correct, 
we do not yet have a satisfactory understanding of its shape and kinematics. 


8 See Englmaier & Gerhard (1997) for a discussion of critical parameters of gas- 
dynamical simulations. 
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Figure 9.43 A hydrodynamical simulation of gas flow in a model of the Galactic potential 
that is based on near-infrared photometry of the Milky Way (§10.1). Upper panel: the 
density of the ISM in real space. The bar’s long axis lies along the line x = y and the 
location of the Sun is marked along the line that is inclined by 20° to bar’s axis. The 
straight lines from the position of the Sun indicate the approximate directions of tangents 
to spiral arms in the Galaxy (Vallée 1995); that immediately to the right of the center is 
tangent to the 3kpc arm. Lower panel: the (I,v) plot obtained when the model is viewed 
from the Sun’s location. The Sun’s rotation speed is ve = 208kms~! and the bar’s 
pattern speed is 57.3kms~! kpc, which places corotation at 3.4kpc. [After Englmaier 
& Gerhard (1997) from data kindly supplied by O. Gerhard] 
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Figure 9.44 Two views of the ISM in the vicinity of the Galactic nucleus. Left panel: 
6-cm radio-continuum emission from Sgr A. The mapped region, 10 pc on a side, shows 
Sgr A East as an extended region of non-thermal emission centered on the middle of the 
frame and the characteristic spiral pattern of Sgr A West to the right of the frame. Right 
panel: An expanded view of Sgr A West. A radio-continuum contour map is superposed 
on contours of the intensity of emission from HCN molecules. [Left panel: VLA data of 
Killeen & Lo courtesy of K.Y. Lo; right panel after Giisten et al. (1987)] 


9.5 The nucleus 


Probably the most important gap in our understanding of the Universe re- 
lates to what goes on inside galactic nuclei. Observations of quasars and 
other active galactic nuclei (AGN; §4.6) indicate that more energy has been 
released by AGN since the Big Bang than by all the stars that have ever 
lived (see Problem 9.13), yet we have only vague theories of how this energy 
is released and know little of how it affects the surrounding Universe. Two 
major reasons for this sad state of ignorance are that whatever machinery 
powers an AGN is very small, and that only one galactic nucleus lies within 
the Galaxy. The study of this one readily accessible nucleus is clearly of the 
utmost importance. Our discussion closely follows that of Blitz et al. (1993), 
who give many references to the literature. 

The Galactic nucleus is far from active. In fact, understanding how it 
contrives to be so quiescent is a major puzzle. However, we have reason 
to believe that all nuclear activity in galaxies is episodic, so understanding 
the low luminosity of our own nucleus may enable us to understand the 
hyperactivity of other, more distant, nuclei. 

Figures 9.30 and 9.44 show that interesting things go on in the inner 
~ 30pc of the Galaxy. Figure 9.30, which covers a region 85 pc on a side, 
gives an overview of radio-continuum emission from this region. The Galactic 
plane runs from top left to bottom right of the panel, whose coordinates 
are right ascension and declination. As we saw in §9.3.Z, the filaments of 
synchrotron emission seen up and to the left of the center of this figure are 
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unique in the strength of the magnetic field in them and their orientation 
perpendicular to the plane. 

At the lower right of Figure 9.30 we see the radio source Sgr A, which 
surrounds the nucleus of the Galaxy. The bright dot at its center is the 
unusual non-thermal radio source Sgr A*, which is thought to mark the 
very center of the Galaxy. This dot lies at (J,b) = (—0.054°, —0.046°) to 
the right (West) of a crescent of enhanced emission. The left-hand panel 
of Figure 9.44 reveals the structure of this crescent, which extends about 
8 pc from top to bottom. Sgr A* lies near the center of a spiral of emission 
that is evident below and to the right of the picture — we refer to this as the 
mini-spiral. This emission is predominantly Bremsstrahlung radiation from 
ionized gas and its source is called Sgr A West to distinguish it from the 
more extensive region of non-thermal emission that dominates the rest of the 
picture; this non-thermal source is called Sgr A East. It is probably a shell 
of shocked gas, which is blasting outwards through the ISM that surrounds 
the nucleus. This blast wave may have been powered by supernovae and fast 
stellar winds, or by something more exotic. 

The right-hand panel of Figure 9.44 reveals some of the structure of 
Sgr A West by superposing a contour map of 5 GHz continuum emission on 
a gray-scale representation of the intensity of emission by HCN molecules. 
One can trace the mini-spiral in the 5 GHz emission. It just about fits inside 
an elliptical ring of HCN emission, whose semi-major axis is ~ 2pc long. 
From observations in other molecular lines we know that the HCN emission 
comes from the inner edge of a ring of molecular material that extends to 
~ 7pc from Sgr A* — this is called the nuclear molecular ring.® This ring 
clearly does not lie within the plane, for we do not see it edge-on. Its velocity 
field has been successfully modeled by circular rotation at ve = 110kms~} in 
a plane on inclination 7 = 65° and position angle ~ 20°. Since v, is inferred 
to be approximately independent of radius, in the range 2 pc < r < 7pc the 
mass of the Galaxy must increase as M(r) ~ 2.8 x 10®(r/pc) Mo. 

Within the hole at the center of the nuclear ring, filaments of ionized gas 
form the mini-spiral. Observations of infrared and radio-frequency lines show 
that velocity varies systematically with position along the spiral arms. Lacy, 
Achtermann & Serabyn (1991) found that they could obtain an acceptable 
fit to this velocity field if each parcel of gas was assumed to move within 
the plane of the nuclear ring on a circular orbit about Sgr A*: in this model 
gas does not flow along the spiral arms; rather the arms are filaments that 
are being wound into ever tighter spirals by differential rotation. The model 
requires a central mass 2 — 3 x 10® Mj with up to 3 x 106 Mg of further 
material distributed through the inner 2 pc. 

Gas within the nuclear ring is ionized by radiation from a remarkable 
star cluster whose center lies within 0.04pc of Sgr A* and whose half-mass 
radius is ~ 0.5pc. This cluster can only be studied at infrared wavelengths, 


9 See Lacy (1994) for a review and references. 
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in which it appears as a tight clustering of hot-spots. About a dozen of 
these hot-spots have been shown to be sources of He J emission at 2.06 ym, 
which implies that the responsible stars are hot (Teg ~ 20000K). If these 
stars are supergiants or Wolf-Rayet stars, they are young, which implies that 
about 10” yr ago there was a burst of massive star formation at the center. 
Alternatively, they may be stars that have been profoundly modified by mass 
exchanges in close collisions with other stars. Indeed, the density towards 
the center of the nuclear cluster is estimated to be 2 10° pc~?, which may be 
compared with < 0.1 pc~$ in the solar neighborhood. In these circumstances 
serious traffic accidents are only to be expected. Allen (1994) discusses these 
possibilities and gives references to the literature. 

Sgr A* is a variable non-thermal radio source of size 0.3 + 0.1mas 
(2.4 AU). Its luminosity is hard to determine accurately because it may be 
dominated by optical and UV radiation that is completely obscured. Mezger 
(1994) places an upper limit on it, L S 7x 10° Lo. In principle, Sgr A* could 
be a neutron star, but it is more likely to be a massive black hole. First and 
foremost, observations of other galaxies make it likely that there is a massive 
black hole in the nucleus (§11.2.2). 

There are two ways to estimate the mass of this putative black hole. 
One is to determine the velocities of stars and gas in its vicinity and to infer 
from them the magnitude of the gravitational field that confines them. We 
have seen above that gas velocities at r ~ 1 pc imply that about 10® Mo lies 
within the inner parsec. Measurements of stellar velocities enable us to probe 
the Galactic potential much closer to Sgr A*. Genzel et al. (1997) find that 
the proper motions of stars that are located within r ~ 0.016 pc of Sgr A* 
have dispersion o,, ~ 1150kms~+. From such data one can estimate the 
mass M(r) contained within a sphere of radius r about Sgr A*. Figure 9.45 
shows that M seems to converge at small r to M(0) ~ 2.6 x 10® Mo. 


Does this mass reside in Sgr A* or simply around it? If Sgr A* is a 
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black hole of mass 2 x 10° Mo, its Schwarzschild radius!® would be a mere 
2x 10—’ pc, much smaller than the radius of the last point in Figure 9.45. 
Hence, in principle the data plotted in Figure 9.45 are consistent with M(r) 
turning down towards zero inside the last observed point, but the mass den- 
sity implied by such a turn-down would exceed p ~ 2x 10!2 Moy pe~?. More- 
over, a simple dynamical argument enables us to constrain directly the mass 
of Sgr A* itself. Because the stellar density around the nucleus is very 
high, the local dynamical relaxation time (§8.3.4 of BT) is short and Sgr A* 
should be in dynamical equilibrium with the nuclear cluster. Consequently, 
its random velocity should be drawn from a Maxwellian whose dispersion is 
smaller than that of the nuclear cluster, 0 by a factor (M,/Ma-)!/?, where 
M, ~ 10Mo is the mass of a typical star in the cluster (Genzel et al. 1997) 
and Ma. is the mass of Sgr A*. We cannot determine the radial velocity 
of Sgr A* because its spectrum contains no lines, but we can measure its 
proper motion — see §10.3.3 below. We may resolve this into components 
parallel and perpendicular to the plane. Only the perpendicular component 
is useful for our present purpose: it implies that the perpendicular velocity 
of Sgr A* is vu, = —11.648.9kms~!. Thus the data are essentially con- 
sistent with v, being vanishingly small, and Ma- being much larger than 
Mg. Suppose on the other hand that further measurements demonstrated 
that v; = 10kms7!, say. Then we could argue that the probability of uv, 
being so low is 


Ma: i ‘2) . (9.24) 


P(< n.) =et{( M oe 


For o,4 = 1150kms~! this yields probabilities P(< 10kms~+) = 0.003, 0.03 
and 0.3 for Ma» = Mo, 100Mo and 104Mo, respectively. Thus it is highly 
implausible that Sgr A* is a neutron star, and extremely unlikely that it is 
the remnant of a supergiant star. Moreover, the errors in the determination 
of v, are rapidly decreasing, so these constraints on Ma-+ should become 
even stronger within a few years. 


9.6 Small-scale structure of the ISM 


The nature of a galaxy’s ISM profoundly affects all the observable properties 
of the galaxy because it is from the ISM that stars form. In this section we 
examine the small-scale structure of the ISM, and investigate the connection 
between clouds of dense, cool gas and the formation of stars. Most of our 
knowledge of the small-scale structure of the ISM comes from studies of the 
Milky Way’s ISM because we can study this at very much higher spatial 
resolution than the ISM of any other galaxy. 


10 The Schwarzschild radius of a body of mass M is defined to be rs = 2GM /c?. 
Even photons do not move fast enough to escape a body’s gravitational field at radii 
smaller than r,. 
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The ISM is extremely inhomogeneous. Studies of interstellar absorption 
along lines of sight to nearby stars indicate that the Sun is located in an 
irregular region of low density. In most of this region, which extends to 
over 100 pc from the Sun in some directions, but only ~ 10 pc in others, the 
number density of H nuclei is nq < 10° m3 and the ionization level is high 
(e.g. Pounds et al. 1993). Scattered throughout this low-density region there 
are many clouds of denser material. The clouds have radii of a few parsecs 
and column densities Ny in the range 1024 — 102° m~?. From these numbers 
one infers particle densities ny 2 10’m7* and masses M ~ 100 Mo. The 
Sun is located in one of these small clouds, which occupy only a small fraction 
(S 5%) of the volume of the Galactic disk, but contain perhaps 90% of the 
mass of the ISM near the Sun. In the clouds the temperature is ~ 100K, 
while the temperature in the low-density material is ~ 10000 K. 

At distances of order 100 pc from the Sun there are large volumes of 
much hotter plasma. This plasma must have a temperature ~ 10° K because 
it is rich in highly ionized species such as O°+, and emits soft X-rays. The 
plasma must be located more than 100 pc from the Sun because in X-ray 
maps made with the ROSAT observatory, nearby, cool interstellar clouds 
show up as dark filaments on a bright background; these clouds lie in front 
of the X-ray emitting plasma. 

Regions filled by million-degree plasma are thought to form when gas 
that has been accelerated to velocities in excess of 1000 kms~! by supernovae 
and fast winds from hot, young stars, crashes into essentially stationary gas. 
This collision gives rise to a shock front that thermalizes the kinetic energy 
of the high-velocity gas. Outflows that are driven by many young stars and 
supernovae often give rise in this way to long thin regions of million-degree 
plasma, which are called worms. The largest worms extend from the plane 
right up into the Galactic halo, and are conspicuous features of 21-cm maps 
of the nearby Galaxy (Heiles 1984). 


9.6.1 Molecular gas in the Galaxy 


Stars do not form from in clouds of ionized gas, or even in clouds of HI, 
but in molecular clouds. Hence we now examine these enigmatic objects in 
some detail. Solomon et al. (1987) and Solomon & Rivolo (1989) studied 
the properties of 439 molecular clouds that they identified in a survey of the 
region (8 < | < 90,|b| < 1) in the 2.6mm line of }*CO. Their beam had 
a half-power full width of 47 arcsec and their spectra were taken 3 arcmin 
apart, so their survey was slightly undersampled, but could resolve a cloud 
bigger than 20 pc in diameter anywhere inside the solar circle. Clouds were 
identified by searching for closed surfaces of constant antenna temperature 
T, in three-dimensional (1, b, v)-space. Such a closed surface was catalogued 
as a cloud providing: (i) it contained at least 25 points of the (I,b,v) grid; 
and (ii) the total intensity contained within it exceeded a given threshold. 
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Figure 9.46 Physically bigger 
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For example, surfaces on which J’, = 4K were required to enclose at least 
60Kkms~! of emission. The value of T4 at which closed surfaces were 
identified varied with (1, ) so as to ensure that it was everywhere above the 
background intensity. Inevitably, this procedure enables smaller clouds to 
be cataloged in regions of low background intensity (such as the anticenter 
direction) than in regions of high background intensity. 

Each catalogued cloud was characterized by its dispersions, weighted 
by T4, in 1, b and v. A model circular-speed curve was used to associate 
each cloud with a location (R,¢) in the Galaxy as described in §9.1.1. Once 
this location had been determined, the distance d to each cloud could be 
calculated and the cloud’s physical size S estimated as 


S = dtan(./o70»), (9.25) 


where o; and ga» are the cloud’s dispersions in longitude and latitude. Fig- 
ure 9.46 shows that a cloud’s dispersion in v, oy, is quite tightly correlated 
with S; when a straight line is fitted to the points in Figure 9.46, one obtains 
the mean relation 


oy 


renee 0.1)( 


Ss 0.5+0.05 
) (9.26) 


pe 


The median value of o,, for the clouds is ~ 3kms~1, which is much larger 
than the sound speed in gas as cold (T < 30K) as that of the clouds. Con- 
sequently, o, must be dominated by turbulent velocities within the clouds. 
The turbulent pressure within a cloud is thus P ~ ngm,o?2. In molecular 
clouds ny > 10®m~? and this turbulent pressure exceeds by more than an 
order of magnitude the pressure in the ISM between molecular clouds, which 
is of order ~ NHMpC2, with ny ~ 10° m-? and c, ~ 10kms~!. If molecular 
clouds are not confined by pressure from the surrounding ISM, they must 
be either freely expanding, or be confined by gravity. The second possibility 
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seems the most plausible. By the virial theorem, it implies that the mass of 
a cloud of size S and velocity dispersion o, is of order 
o2S 

G ’ 


M=K (9.27) 
where K is a dimensionless numerical factor that depends on the geometry 
of the clouds - Solomon e¢ al. (1987) suggest that K ~ 8.7, and we adopt 
this value in the following. 

Figure 9.47 shows the resulting distribution over mass of the clouds in 
the samples of Solomon et al. sample. At the high-mass end, this distribution 
is consistent with a power-law relation 


x «M~* with a=1.5. (9.28) 
At low masses the distribution falls well below this relationship because low- 
mass clouds cannot be detected throughout the inner Galaxy and no attempt 
has been made to correct the data for this bias. In a study of clouds in 
the nearby Perseus arm (see below) Digel et al. (1996) found that the best 
power-law fit to the mass spectrum in the range 2000 < M/Mo < 40000 
has exponent a@ = 1.7, which is close to the value of equation (9.28). If the 
mass distribution of the clouds follows equation (9.28) at high masses, the 
mass contained in clouds less massive than M diverges as M1/?. This result 
indicates that most of the Galaxy’s molecular gas is concentrated in the most 
massive clouds. 

Equation (9.27), together with the result ¢ « S°°, implies that all 
clouds have the same mean surface density, ~ 170 Me pe’, and that the 
characteristic density of a cloud decreases with increasing mass. 

We conclude this chapter by returning to the crucial task of understand- 
ing the connection between the amount of matter in a given cloud and the 
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measured intensity of the CO line radiation from it. The latter is conven- 
tionally measured in terms of the quantity Ico that is defined by equation 
(8.25), and its relation to the total column density of H is expressed in terms 
of the quantity X that is defined by equation (8.28). X has been determined 
in three ways. We discuss each in turn. 


X from virial masses Solomon et al. (1987) show that the mass of a 
cloud is tightly correlated with the integral, Loo, of Ico over its surface. 
The mean relation is 


0.8140.03 

ite =3 aS) where Lco = [ &xtco. (9.29) 
In an individual cloud, both N(H2) and Ico will decline from maxima near 
the center to small values at the edge. If X were constant, Loo, which is 
the area-integral of Ico, would be proportional to M, which is proportional 
to the area integral of N(H2). Since the exponent in equation (9.29) is not 
unity, Ico cannot be precisely proportional to N(H2). However, since 0.8 
is close to unity and, as we have seen, most molecular gas is in the largest 
clouds, we overlook this problem and evaluate X as the ratio of the mean 
value of N(H2) to the mean value of Ico in a typical large cloud of mass 
M = 10° Mo. We then find X = 2.5 x 10%. 


X from y-rays In §8.1.5 we saw that y-rays are produced when cosmic 
rays collide with protons and other atomic nuclei. Hence, at each point in 
the Galaxy, the 7-ray emissivity, j., should be proportional to the product 
qynu, where q, is the cosmic-ray flux. Cosmic rays are thought to diffuse 
quite rapidly through space, so g should vary much less rapidly with position 
than ny. If ¢, were constant throughout the disk, j, would be proportional 
to ny, and the observed y-ray intensity down any line of sight, [,, would | 
be proportional to N(Htot). In reality, g, does vary through the disk, but, 
to a reasonable approximation, it can be assumed to be a function only of 
Galactocentric radius. With this assumption, Strong et al. (1988) estimated 
X by decomposing I,(I,6) for 1 > 12° into a sum of contributions I,,;(I, b) 
from each of a series of annuli in the disk, and then writing 


Lalt,b) = ai (Mini(l b) + 2X;Icoi(l, b)) Aye: (9.30) 


Here Ny: (1, 6) is the column density of HI through the annulus in the direc- 
tion (1,6), X; is the value of X for the annulus, Ico; is the 2.6 mm brightness 
temperature at (1,6) integrated over the velocities of the annulus, and Ip; is 
the (small) contribution to [,; from inverse-Compton scattering and point 
‘y-ray sources such as pulsars. Finally, Strong et al. optimized the fit between 
the observed and predicted values of [,(1,b) by varying the parameters q,; 
and X;. This procedure was be carried out for each range of y-ray energies 
for which data were available from the COS-B satellite. 
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Strong et al. first assumed that X was the same for every annulus and 
showed that the best fit to the data is obtained when the q,; are taken to 
be independent of energy, while X decreases slightly with increasing energy. 
Then they asked whether significantly better fits to the data could be ob- 
tained by allowing X to be smaller by a factor a at R < 0.8Rpo than at 
R > 0.8R9. They concluded that the low-energy data favor large values of 
a, while the high-energy data, which they consider most reliable, are con- 
sistent with a = 1. They conclude that q, decreases by about a factor 2 
between 0.2Ry and 1.5Ro and that the best value of X is (2.340.3) x 1074, 
in remarkably good agreement with the result of Solomon et al. (1987). 


X from Ay In 88.1.4 we saw that the 2.6mm line of 1?CO is optically 
thick in many directions, with the result that Ico is not proportional to 
N(H2) when data of high spatial resolution are examined in detail, line of 
sight by line of sight. Notwithstanding this conclusion, we argued that when 
Ico is averaged over many lines of sight, the average value of Icg may be 
proportional to the average value of N(H2). In the last two subsections we 
have seen how this proportionality between averaged quantities has been 
established and calibrated. Now we establish a relationship for individual 
lines of sight between the intensities of CO lines and values of N(Htot) that 
are estimated from the extinction, Ay, and equation (8.50). 

Since a spectral line that has completely saturated conveys no infor- 
mation about the emitting column density, we focus on the intensity of the 
2.6 mm line from the relatively rare species CO. In Figure 9.48 the velocity- 
integrated antenna temperature of this line, I,3, is plotted against the visual 
extinction, Ay, for lines of sight through the Taurus molecular cloud. The 
two variables are evidently fairly tightly correlated, with a mean relation 


Tg = 1.6A¥;42 Kkms™?. (9.31) 


Hence, the optical thickness of the spectral line causes the line’s intensity to 
increase roughly as the square root of Ay, and thus also as the square root 
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of N(Htot).!! If we could identify a particular point on the mean relation 
(9.31) as the one responsible for the bulk of the CO emission in the Galaxy, we 
could derive the most appropriate value of X. Sanders, Solomon & Scoville 
(1984) argue that the relevant point is at 13 = 7K kms! because this value 
of I;3 is characteristic of giant molecular clouds in large-scale surveys of the 
Galaxy. For this value of 13, equation (9.31) yields Ay = 21.6, and, when we 
insert this extinction into equation (8.50), we obtain N(H2) ~ $N(Htot) = 
2.05 x 10? m-?. Hence, we tentatively write 


N(Ha) 2 fis 
= 2.0 x 10°°—__—_—-. 
m2 ‘: 7Kkms—! 


When giant molecular clouds are identified in large-scale surveys, the velocity- 
integrated antenna temperatures of the 2.6mm lines of 12CO and CO are 
typically in the ratio 5.5:1. Hence, in equation (9.31) we replace [13 by 
Ico /5.5 and conclude that X = 5.2 x 1074 K kms~!m?. This value is twice 
as large as our other estimates. Clearly, a lower value would have been ob- 
tained if we had started from a lower point on the line in Figure 9.48 than 
that at 43 = 7K. This method of estimating X has the virtue of forcing us 
to recognize that no value of X can yield more than a crude approximation 
to the truth. 


Problems 


9.1 The velocity at point in an expanding ring is Q x R+ (v./R)R, where 
R is the Galactocentric position vector of the point. For such a ring, show 
that equation (9.3) may be written 


d* + R? — R? 

——_——_—» 
2Rd ss 

where d is the heliocentric distance to the point in the ring that lies along 


the line of sight at Galactic latitude 1. Show that, in the limit R — oo, the 
last term tends to ve. 


9.2 Estimate the path-length one must go at (a) 1 = 30° and (b) J = 90° for 
the spread in the radial velocities of hydrogen atoms due to Galactic rotation 
to exceed the ~ 9kms~! that arises from interstellar turbulence. [Assume 
that the rotation curve is perfectly flat and of amplitude v, = 220kms~11] 


Vjos(L) = [Q(R) — Q(Ro)] Ro sin! + (9.32) 


9.3 Determine which way around the aerofoil-shaped curve of Figure 9.8 a 
cloud moves as it orbits in Figure 9.7 clockwise around the large full curve. 


9.4 If the circular speed of the Milky Way is constant at v, = 220kms7! 
from Ro outwards, and no 21-cm emission is seen beyond v,, = 130kms~! 
at | = 225° (Knapp, Tremaine & Gunn 1978), derive an upper limit to the 
radius of the HI disk under the assumption that the disk is perfectly flat. 


11 There is an interesting parallel between this result and the result of Problem 8.2. 
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9.5 Estimate the telscope time that would be required for a fully sampled 
survey of the Milky Way at |b| < 5° in the 266mm CO line with a 10m 
single-dish telescope. Assume that each pointing takes one minute. 


9.6 The HI disk is seriously warped at large R. In which direction has 
one the best chance of detecting 21-cm emission from the largest possible 
Galactocentric radii? 


9.7 We will see in §10.4.3 that the thin component of the stellar disk in 
the solar neighborhood has a characteristic thickness of 21/2 ~ 250 pc, and 
a velocity dispersion perpendicular to the Galactic plane of o, ~ 18kms~!. 
Use these values, together with the hydrostatic equilibrium relation of equa- 
tion (11.15), to predict what the local thickness of the HI disk should be 
if it were also in hydrostatic equilibrium. Hence, show that the observed 
thickness of the HI disk is approximately a factor of three too large, and 
that additional forces must be responsible for its support. 


9.8 Express Galactic latitude, /, as a function of the Galactocentric polar 
coordinates, (R, ¢). 


9.9 Assuming that one third of the vertical support of the HI layer comes 
from the random motions of clouds, and neglecting the complication intro- 
duced by the Milky Way’s warp, use the data in Figure 9.25 to estimate how 
the surface density of material near the Galactic plane, U(R), varies with 
radius. 


9.10 Let R;, be the inner radius of the middle disk and assume that the 
circular speed is constant at v, at radii in the interval Ri, < R < Ro. Show 
that the maximum line-of-sight velocity at which 21-cm emission from the 
middle disk will be observed at longitude ! is 


Umax = Ze _ 1) ve sin l. (9.33) 


9.11 Assume that the Galactocentric radius, R, of an HI cloud is given as a 
function of time by R = R, + X cos xt, where R,, X and « are constants (cf. 
BT §3.2.3). Assume further that « ~ /2Q, where 2 is the circular frequency 
at R,. Show that 
X _ v,(max) 
Ry V20v- 


where v,(max) is the largest line-of-sight velocity at which the cloud could 
possibly be detected against the Galactic center and v, = RgQ. Hence, 
estimate the elongation of the orbits that would be required if clouds are to 
be observed at 150kms~! along / = 0. 


(9.34) 


Problems 605 


9.12 Burton & Liszt (1978) argue that the Milky Way’s nuclear disk is tilted 
by 22° with respect to the plane b = 0 and 78° with respect to the plane of 
the sky. Using the notation of §9.4, show that this orientation corresponds 
to (0, = 22°, d, = 56°). Show further that the line of nodes is the line 
tan b = —0.335 tan. 


9.13 The observed luminosity density of quasars as a function of redshift 
implies that AGN have radiated Exgn 2 2 x 10° Moc? Mpc? (Chokshi 
& Turner 1992). Compare this energy density with that yielded by the 
nucleosynthesis of the metals required to raise all luminous galaxies to solar 
metallicity. Assume that the galaxy luminosity function is given by equation 
(4.3), that galaxies have mass-to-light ratio Tg = 3, and that conversion of 
a mass M of primordial gas to solar-metallicity gas releases energy Eng = 
1.4 x 107-4 Mce?. 


10 
Components of the Milky Way 


We are able to study the workings of one particular galaxy, the Milky Way, in 
extraordinary detail. In this chapter we shall examine observations of stars 
within the Milky Way with two goals in mind. First, we wish to determine 
what kind of galaxy the Milky Way is. Where does it lie on the Hubble 
sequence? How large is it? Second, we want to learn as much as we can about 
galaxies in general by closely examining our particular specimen. What kinds 
of stars are disks made of? Are thick disks made of the same kinds of stars 
as thin disks? How are the different morphologies of thin disks, thick disks 
and bulges reflected in the patterns of motion of their constituent stars? Do 
the sort of high-resolution measurements that we can make of the Milky Way 
provide any clues as to how galaxies formed and evolve? 

Ideally, the existence of the Milky Way’s components would emerge nat- 
urally from the observational data rather than being assumed at the outset. 
It may be that the data are now so extensive that the components can be 
made to emerge in this way, but to approach the data in a totally unbiased 
way would be a large, and, we fear, confusing enterprise. It is didactically 
much easier to say ‘here we have stars belonging to the thick disk, over there 
you see characteristic bulge objects,’ and so forth. At the end of the day 
the reader must judge whether the picture that has been painted is a co- 
herent and convincing one, or a modernist daub that only a pseud or an art 
gallery director would buy. In particular, controversy rages as to whether the 
components into which we shall divide the Galaxy are really distinct objects 
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or just the beginning the middle and the end of an inherently continuous 
structure. By constantly referring to these components, we do not wish to 
prejudice the outcome of this controversy. 

What makes the task in hand exceptionally confusing is the great variety 
of the available data. The crudest data are surface-brightness measurements 
analogous to those we have of external galaxies. Although these data are 
crude in the sense that they distinguish very little between different kinds 
of stars, they are hard to gather: the problem of sky subtraction, which is 
difficult in the case of external galaxies (see §4.2), is horrendous in the case 
of the Milky Way, which covers the whole sky rather than just a fraction of 
a square degree. Fortunately, in the last few years advances in the surface 
photometry of the Milky Way have been made which have allowed us to 
probe for the first time the surface brightnesses of the disk and bulge. The 
key has been to observe in the infrared from space: by working in the infrared 
one can largely overcome the problem posed by the huge quantities of dust 
that lie between us and the inner Galaxy, and by working from space one 
can make the problem of sky-subtraction manageable. 

The earliest work on Galactic structure was exclusively concerned with 
star counts; that is, the numbers of stars with given colors per unit area of 
sky per magnitude interval in each of several directions on the sky. In the 
1980s this field was revived, after decades of torpor, by two advances: (i) 
the advent of automatic plate-measuring machines and friendly computers 
to analyze the data they produce; (ii) models of the Milky Way that are 
inspired by observations of external galaxies, and allow one to predict the 
star counts for any field given assumed disk and bulge parameters. 

The spectra of stars provide crucial insights into Galactic structure, 
because they betray not only the star’s radial velocity, but also its chemi- 
cal composition. Unfortunately, spectroscopy is time consuming even with 
modern techniques. Therefore one has to be selective in deciding which stars 
should be observed. Two very different strategies are commonly employed. 
The selected stars are those in a given field and magnitude range that either 
have specified colors, or have specified proper motions. These selection cri- 
teria are invaluable because they allow one to focus on one component of the 
Milky Way in preference to the others. However, they greatly complicate the 
statistical interpretation of spectroscopic surveys because one knows exactly 
which portion of the Milky Way a given survey has brought into focus only 
when one has a complete understanding of the structure of the Milky Way as 
a whole — and, of course, it precisely because one lacks that knowledge that 
the survey was undertaken in the first place! In these circumstances, models 
play an essential rdle in the interpretation of data; the data are interpreted 
in terms of the model that was used to design the selection criteria, and any 
discrepancies lead to a revision of the model and perhaps the definition of 
a new survey. This process is, unfortunately, both laborious and potentially 
confusing. 
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The confusion that is introduced by the interpretation of data in terms 
of inaccurate models is exacerbated by a fundamental weakness in the avail- 
able models, namely, their near total disregard for Newton’s laws of motion. 
In reality the positions and velocities of stars are tightly connected by dy- 
namics, but the techniques of dynamical modeling have developed so slowly 
that most current models reflect this coupling only to an extremely limited 
degree. Not only does this make the models internally inconsistent, but it 
endows them with more free parameters than are strictly necessary, and this 
circumstance obviously makes it harder to search for a model that can ac- 
count satisfactorily for any given body of data. Fortunately, the stage does 
now seem set for the advent of dynamically consistent models — see Dehnen 
& Binney (1996) for details. 

The layout of this chapter is the following. We start by examining the 
Milky Way using the kinds of data that are available for external galaxies 
— surface photometry and line-of-sight velocity distributions. Specifically in 
§10.1 we examine the structure of the Milky Way from surface photometry, 
and in §10.2 we use velocity data to examine the bulge. Then we turn to an 
analysis of the solar neighborhood, which we can observe by means of tools 
that will probably never be available for the examination of most external 
galaxies. In §10.3 we analyze the velocities of common stars near the Sun. 
These measurements enable us to determine the peculiar velocity of the Sun 
itself, and to probe the differential rotation of the Galactic disk. In §10.4 
we describe correlations that exist between the ages, chemical compositions 
and kinematics of stars that are confined to the disk. Combining insights ob- 
tained in this way with results obtained from star-counts, we shall conclude 
that the disk is a composite object, and determine its surface density. Next, 
in §10.5, we examine the halo. This component can be traced to large dis- 
tances through globular clusters, field HB stars and metal-weak giants, and 
can also be studied through its rare representatives in the immediate solar 
neighborhood. §10.6 describes computer models of the Milky Way. These 
models combine all the Galaxy’s components in such a way that the results 
of various types of surveys can be predicted before observing begins, and 
interpreted physically after the observations are complete. Thus, in a sense, 
they summarize our understanding of the Milky Way. Finally, in §10.7, we 
discuss the implications of all that we have learned for theories of how the 
Galaxy came to be structured the way it is. This is a question on which there 
is still no consensus within the scientific community, so this section strays 
further from the bedrock of secure knowledge than does most of the rest of 
this book. 
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10.1 Gross Structure from Surface Photometry 


Most of the Milky Way is heavily obscured by dust — its center is obscured 
by ~ 30 magnitudes in V. Hence, surface photometry that contains useful 
information on the large-scale structure of the Galaxy can only be obtained 
at infrared wavelengths, at which dust is much less opaque than in the vis- 
ible. While obscuration by dust decreases monotonically with increasing 
wavelength, the optimum waveband for observations of the Galaxy lies in 
the near infrared. Typical stars radiate a substantial fraction of their lumi- 
nosity in the near infrared, while contributing very little luminosity to the 
mid and far infrared. Interstellar dust, by contrast, emits most strongly in 
the mid and far infrared (§8.1.6). Hence the stellar system stands out most 
prominently against the background from dust at 2 — 4m. 

The Galaxy has been observed in the near-infrared with an angular 
resolution of order a degree by several balloon-borne telescopes, from the 
Space Shuttle and by the COBE satellite — for references see Kent, Dame 
& Fazio (1991) and Freudenreich (1998). Structure on scales smaller than a 
degree, which is important close to the center (where one degree corresponds 
to 140 pc), has only been studied from the ground (Becklin & Neugebauer 
1968, Allen et al. 1983, Eckart et al. 1994). 

Even at 2 — 4m there is non-negligible extinction over much of the 
bulge, and this absorption must be taken into account when interpreting 
observations. This has been done in three entirely different ways. 

In the approach of Kent, Dame & Fazio (1991), one starts by deriving 
a model of the distribution of interstellar gas in the Galaxy from radio- 
line observations of HI and CO. The optical depth 7(R,¢, z) to absorption 
between the point (R,¢,z) in the Galaxy and the Sun then follows from 
the assumption that the standard gas-to-dust ratio, equation (3.65), holds 
everywhere. Once 7 has been determined, the surface brightness predicted 
at (1,0) is simply 


I(1,8) = [ ” dse-"je(R, ¢,2), (10.1) 


where (R, ¢, z) is the point at a distance s from the Sun in the direction (I, b), 
and j, is the stellar luminosity density. Kent e¢ al. found that they could 
accurately reproduce small-scale structure in the 2.4 ym brightness distribu- 
tion that was observed from the Space Shuttle with a smooth distribution of 
luminosity density. That is, small-scale irregularities in the observed 2.4 zm 
surface brightness arise naturally from patchiness in the predicted obscura- 
tion rather than from irregularities in the stellar distribution. 

An objection to the analysis of Kent, Dame & Fazio (1991) is that it 
employs a model of the interstellar hydrogen density that is uncertain for 
two reasons: first, it depends upon both the assumption of circular rotation, 
which we have seen is significantly in error for R S 4kpc (§9.4); second, 
it involves the conversion of CO linewidths to hydrogen column densities 
Ny and, as we saw in §9.6.1, this conversion is problematical, especially 
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near the Galactic center, where obscuration is heaviest. Spergel, Malhotra 
& Blitz (1996) estimated the obscuration by a different technique that is 
not subject to this objection. They first modeled the emission observed by 
COBE at 240 ym as thermal radiation from dust — see §9.3.1. As a first 
approximation, the dust was taken to be distributed in an axisymmetric 
double-exponential disk — see equation (4.45) — in which the scale height 
zo of the dust disk increased linearly with radius. The free parameters in 
this model were adjusted to optimize the fit between the observed 240 um 
emission and that predicted by the model. Let paxi(R, z) be best-fit model’s 
dust density and I,,;(1,b) its 240 wm surface-brightness distribution. Then 
a non-axisymmetric model of the dust density that exactly reproduces the 
observed brightness distribution Igp.(I, b) is 


Tops (1, b) 
Taxi(l, b) ; 


where (I, b) is the direction from the Sun to the point (R, ¢, z) in the Gal- 
axy. Physically, this equation distributes any additional dust that may be 
required along a given line of sight in proportion to the dust density of the 
axisymmetric model. Although this assignment must be wrong in detail, it 
represents as reasonable a guess as can be made without using velocity infor- 
mation of the type employed by Kent et al..1 Spergel et al. used this simple 
dust model to calculate for each COBE waveband optical depths, 7(R, ¢, z), 
to absorption from each point in the Galaxy. A valuable check on the accu- 
racy of the model is available, since it predicts, from the 240 um data alone, 
the amount by which light will be reddened in passing from any point in the 
Galaxy to the Sun. This prediction may be compared with the reddening 
inferred from the near-infrared data, which are available in the J, K, L and 
M bands. 

Whereas Kent et al. and Spergel et al. modeled the distribution of ob- 
scuring dust before fitting a model to the stellar luminosity density, Freuden- 
reich (1998) modeled the dust and stellar distributions simultaneously. How- 
ever, Freudenreich’s modeling procedure employs neither radio-frequency line 
data nor far-IR data, but depends only on the near-IR data. The procedure’s 
key assumption is that neither the bulge nor the disk has an intrinsic near-IR 
color gradient. 

Freudenreich started by defining parameterized models of the dust den- 
sity and the luminosity densities of the bulge and disk. From these densities 
and equation (10.1), he then predicted for each COBE waveband X the ob- 
served intensity Ix (1,6) for many directions (1,b) on the sky. Finally, the 
parameters of all three density distributions were simultaneously adjusted to 


Pdust (R, dg, z) = Paxi(R, z) (10.2) 


1If a good model of the Galaxy’s non-axisymmetric velocity field were available, a 
better dust model could be obtained by distributing excess dust at (I, b) along the line of 
sight so as to optimize the fit between the predicted HI and CO line profiles at (1,6) and 
the observed ones. 
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Table 10.1 Parameters of photometric models of the Milky Way 


Kent, Dame & Fazio (1991) Freudenreich (1998) 
La 49x 10! Lo Da 2.2410 Ts 
Ix (1000 + 200) Le pe~? Iq 867 Lo pe? 
Ra (340.5) kpe Ra 2.5kpc 
zo [0.165 + 0.21(R/Ry — 3)| kpc | 20 0.167 kpc 

Ry 2.94 kpc 
(qu, 8) (0.87, 1.72) 
Ip 1.1 10 Lo Ly 0.62 x 102° Lo 
I, (7400 + 1500) Leo pe-? I, 1300 Lo pe~? 
sp (0.740.2)kpc (@z,@y,@z) (1.69, 0.63, 0.42) kpc 
gq 0.61+0.18 (c1,¢) (1.55, 3.43) 
(Rpar,; thar) (2.97,0.44) kpc 
go 14° 


NOTES: Both models assume Rp = 8kpc. The Kent et al. model is for the 
K band while Freudenreich’s is for the D band. 


optimize the fit for each waveband between Ix and the intensity measured 
by COBE. 

Kent, Dame & Fazio (1991) assumed that both bulge and disk are ax- 
isymmetric bodies, with luminosity densities of the form 


I, 
ja(R, z) = ree me 
(10.3) 


. qj 
jo(R, z) = ae, Ko(s/sy) where s* = R*+ (z/q)*. 


Here J4 is the central face-on surface brightness of the disk and J, is the 
central surface brightness of the bulge. Ko is a modified Bessel function, 
which ensures that the bulge has an exponential surface-brightness profile 
— see Problem 10.1. The variable s is defined so that the bulge has boxy 
isodensity surfaces of axis ratio q,. The left-hand column of Table 10.1 gives 
the best-fitting values of the various parameters. 

The parametric model of Freudenreich (1998) differs from that of Kent 
et al. in several important respects. First, the disk is assumed to have an 
elliptical hole at its center,? and to have a vertical density profile that follows 
the sech” law [equation (4.44)] rather than a simple exponential. Second, in 
view of the kinematic evidence that the inner Galaxy is triaxial that was 
reviewed in §9.4, Freudenreich assumed that the bulge is triaxial — the Sun— 
center line is assumed to make an angle ¢g ~ 15° with the bulge’s longest 


2 If the disk really has a central hole, an external observer would consider the Milky 
Way to be a type II galaxy (§4.4.3). 
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Figure 10.1 Freudenreich’s model of the inner Galaxy when viewed 
along the shortest axis of the bar (top) and along to the bar’s in- 
termediate axis (bottom). There are ten contours per decade in 
surface brightness in the upper panel and three per decade in the 
lower panel. The axes are marked in kpc. 


principal axis. In detail, for R < Ro Freudenreich’s luminosity densities are 
given by 


ja(z,y, 2) = a sech?(z/2z9) e~®/Ra is em), 
220 

I (10.4) 

jo(2,Y, 2) = oA sech?(m,) T(R), 


where J, is the central surface brightness of the bulge when it is viewed down 
its shortest axis, Ig is the corresponding surface brightness for the disk in 
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the absence of the central hole, and 
mp = 4/ (2? + qey?)/R?2 
CL eC, cy /e1 cli 1/cy 
ay | ee) | 
Ay dy az 


{ ag PERS Roar 
e7 (R- Roar) / Pear otherwise 


The hole in the disk, which has radius R ~ Rj, is created by the factor 


(1 - e7™h) in the first of equations (10.4). The long axis of this hole lies 
along the y axis for g, < 1. At Galactocentric radii R < Rpar, the luminosity 
density of the bulge is constant on surfaces of constant mp, which have 
principal axes with lengths in the ratios a,:a,:a,. The shape of these surfaces 
is controlled by the parameters c; and cj. For the best-fitting values of these 
parameters (c, < 2 and c) > 2 -— see Table 10.1), the bulge appears boxy 
when viewed perpendicular to its shortest (z) axis, and pointy when viewed 
from along this axis. The Sun—center line makes an angle of ~ 14° with the 
long axis of the bar, which is taken to be the z axis. The function T(R) 
sharply truncates the bulge’s density near Rp,;. Figure 10.1 illustrates these 
points by showing face-on and edge-on views of Freudenreich’s model. 

A significantly better fit of the model to the data can be obtained if 
the Sun is assumed to lie ~ 16 pc above the Galactic plane. At large values 
of |l| the fit to the data can be further improved if the model described by 
equations (10.4) is modified in two ways: (i) the exponential disk is truncated 
near R = 1.5Rg; (ii) the disk is warped at R 2 Ro. The sense of the warp 
in the stellar disk is the same as that of the warp in the HI disk that was 
discussed in §9.2.6. In particular, the line of nodes nearly coincides with the 
Sun~center line, and the disk bends towards the north at 1 > 0. Since the 
stellar disk does not extend as far as the HI disk, the maximum deviation of 
the stellar disk from the plane of the inner Galaxy is not large (~ 300 pc). 

It is instructive to compare the parameters listed in Table 10.1 for the 
models of Freudenreich (1998) and Kent, Dame & Fazio (1991). In part the 
differences reflect the fact that the Kent et al. model is of the Galaxy’s K- 
band luminosity distribution, while Freudenreich’s model is for the EZ band. 
In view of this difference, there is reasonable agreement between the values 
given for the disk’s extrapolated central surface brightness Ig, scale length, 
Ra and scale height, z9. Comparison of the parameters for the bulge is not 
straightforward because the functional forms used in the two studies are very 
different. It is noteworthy, however, that Freudenreich’s bulge has a central, 
face-on surface brightness that is smaller than that of the bulge of Kent et 
al. by a factor ~ 5.7. This difference reflects the fact that Freudenreich’s 
bulge/bar is long and diffuse. It is disconcerting that the numbers given in 
Table 10.1 for the total luminosity of the disk differ by a factor two. Actually, 
the numbers quoted in the published papers differ by an even larger factor; 
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our values are based on the values of the K- and L-band luminosities of the 
Sun that are given in Table 2.1. Several factors account for the lower lumi- 
nosity of Freudenreich’s disk: its scale length is shorter; its central surface 
brightness is lower; and it has a hole of radius ~ 3kpc at its center. Freuden- 
reich’s bulge is about 50% less luminous than that of Kent et al. because its 
greater radial extent does not fully compensate for its lower characteristic 
surface brightness. 

Given that it is so hard to determine the luminosity of the Galaxy 
photometrically, it is interesting to estimate it from the Tully—Fisher relation 
(§7.3.4) and the local circular speed (§10.6). Equations (7.26) and (10.71) 
with Wi, = 2u. together imply that the H-band absolute magnitude of the 
Galaxy is My = —23.0+ 0.2. Adopting the value of the H-band absolute 
magnitude of the Sun given in Table 2.1, this absolute magnitude corresponds 
to a luminosity of (3.340.6) x 10!° Lo. This value lies between the values give 
in Table 10.1 and is probably our best estimate of the Galaxy’s luminosity. 


10.1.1 The Galaxy at optical wavelengths 


To compare the Milky Way with external galaxies, it is desirable to know 
what the Galaxy would look like when viewed from outside at optical wave- 
lengths. From our location it is extremely hard to construct such a model 
directly. The best approach is to assume that the B — K and V — K colors 
of the Milky Way are similar to those of galaxies of similar Hubble type, 
and to use this assumption to derive the B- and V-band luminosity densi- 
ties from the near-IR models discussed above. From a study of 86 nearly 
face-on spiral galaxies, de Jong (1996) concludes that late-type galaxies like 
the Milky Way have disk scale lengths that are larger in the B band than in 
the K band by a factor (1.25 + 0.25). He finds, moreover, that the disks of 
these galaxies have extrapolated central surface brightnesses that are fainter 
in the B band than in the K band by (3.6 + 0.6) mag, while the bulges have 
B-—K ~ 4.5. From these numbers and those presented in Table 10.1 one 
can infer the B-band luminosity density at any point in the Milky Way. The 
appearance of the Galaxy to an external observer could then be calculated 
by integrating equation (10.1), which includes the effects of obscuration by 
dust, along the observer’s lines of sight. 

There is one place in which we can directly measure the B ~ K color of 
the Galaxy, namely the solar neighborhood. From the IR modeling described 
above, we know that the face-on integrated K-band surface brightness of 
the Galaxy at Ro is ux = 20.62 — see Problem 10.3. To calculate the 
corresponding B-band surface brightness, we proceed as follows. We count 
stars in the direction of a Galactic pole down to some limiting apparent 
magnitude Bi, and sum their fluxes. Figure 10.2 is the resulting plot of 
integrated surface brightness, wg, versus By. It shows that wg converges 
to a value ~ 24.7 by Bim ~ 16. De Vaucouleurs & Pence (1978) argue that, 
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Figure 10.3 A reconstruction of how the Milky Way would appear in the absence of 
dust. The L-band COBE data have been corrected for absorption using equation (10.6) 
and the dust and stellar luminosity models of Spergel, Malhotra & Blitz (1996). Contours 
are spaced by 1 mag. [After Binney, Gerhard & Spergel (1997)| 


in the absence of extinction, the surface brightness would be brighter by 
~ 0.2 mag. If the local disk were viewed from outside, its surface brightness, 
after correction for extinction, would be brighter by 2.5 log 2 mag because 
one would be looking through all the disk rather than half of it. Hence, the 
face-on, extinction-corrected surface brightness of the disk is wg = 23.75. It 
follows that the solar neighborhood has color B ~ K = 3.13. This value is in 
excellent agreement with the B — K colors that de Jong (1996) measures in 
galaxies similar to the Milky Way at radii comparable to that of the Sun. 
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10.2 The bulge 


10.2.1 Integrated surface photometry 


Freudenreich’s model of the Galaxy includes a triaxial bulge because the 
kinematic evidence reviewed in §9.4 indicates that we inhabit a barred gal- 
axy. In the last section we saw, however, that the axisymmetric model 
of Kent, Dame & Fazio (1991) also provides a comparable fit to the near- 
infrared photometry of the Galaxy. This situation prompts us to ask whether 
the near-IR photometry, taken alone, provides compelling evidence that the 
Galaxy is barred. In this enquiry it is important not to restrict oneself to a 
particular parametric form of the Galaxy’s luminosity density. Indeed, unless 
the Galaxy precisely follows the model’s particular functional form, which is 
entirely improbable, the fit to high-quality data provided by the model will 
inevitably be formally poor. This fact deprives us of an objective criterion 
for distinguishing between the fits furnished by an axisymmetric model and 
a barred model. Hence, we now examine the COBE photometry for direct 
indications that the Galaxy is barred. 

The possession of models for the optical depth to absorption 7(R, ¢, z) 
and the emissivity j,(R,¢, z) associated with each point in the Galaxy en- 
ables us to infer what the Milky Way would look like in the absence of dust. 
Specifically we write 


Iaustless model (1, b) 


ustfree (I, b) = 
Ig t fi ( b) Igusty model (1, 6) 


Tops (I, 5). (10.6) 
Unless our models of 7 and j, are seriously in error, Igustfree should yield 
quite an accurate estimate of what a dust-free Milky Way would look like. 
Figure 10.3 shows such a dust-free view of the Milky Way. 
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Although the brightness distribution shown in Figure 10.3 is fairly sym- 
metrical in b, significant asymmetries are apparent in /. The upper panel of 
Figure 10.4 quantifies this phenomenon by plotting on a magnitude scale the 
ratio I(1,b)/I(—l, b) of the brightnesses at +/ in Figure 10.3. In an elongated 
region around the center that extends to |I| ~ 10°, both sides have about the 
same brightness.3 At larger |I| or |b|, the side at | > 0 is brighter than the 
other side. Beyond |I| ~ 20°, this trend reverses, and it is the side at 1 < 0 
that is brighter. Figure 10.5 shows schematically why the nearer end of the 
bar is brighter than the further end: at a given value of || the left-hand line 
of sight crosses the long axis of the bar closer to the center than does the 
right-hand line of sight. 

The lower panel in Figure 10.4 shows the degree to which Freudenreich’s 
triaxial bulge is able to reproduce the observed asymmetry distribution that 
is shown in the top panel. The model correctly predicts that the brighter 
side is at | > 0 for |I| < 20°, but it incorrectly predicts that the Galaxy is 
symmetrical further out. This failure of the model reflects the fact that at 
|i] 2 20° lines of sight do not intersect the model bulge and the model disk 
is very nearly axisymmetric. 

This discussion makes it clear that the triaxiality of any Galaxy model 
derives entirely from the differences between the Galactic luminosity den- 
sity at 1 > 0 and at / < 0 that are contoured in Figure 10.4. Unfortunately, 
these differences are much smaller than the corrections made to the observed 
surface-brightness distribution to allow for obscuration by dust. Hence, a 
subtle change in the dust model is liable to change significantly the assumed 
left-right asymmetry in appearance of the Galaxy from which the bar’s shape 
is derived. We can estimate the importance of this sensitivity by comparing 
Freudenreich’s model with a triaxial luminosity distribution that has been 
derived using a different model for the dust. The data shown in Figure 10.3 
constitute an appropriate data set, and the luminosity distribution can be 
constructed by applying a Richardson—Lucy deconvolution algorithm (Ap- 
pendix C) to these data. In order to focus on the differences implied by the 
alternative dust model, we use Freudenreich’s model as the starting point of 
the Richardson—Lucy iterations. 


3 As Blitz & Spergel (1991b) pointed out, if the Galaxy had an approximately ho- 
mogeneous core, its right-hand side would be brighter than its left close to the Center, 
because lines of sight on the right make smaller angles with the bar’s long axis, with the 
result that they remain within a region of high luminosity density for longer than do lines 
of sight on the left. 
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Figure 10.6 The dotted contours show the central part of Fig- 
ure 10.1. The full contours show the model that is produced from 
this model and the data of Figure 10.3 by seven iterations of the 
Richardson—Lucy algorithm of Binney & Gerhard (1997). 


Comparison of the full and dotted contours in Figure 10.6 show how 
seven iterations of the deconvolution technique modify the model of Fig- 
ure 10.1 to produce a better fit to the data of Figure 10.3. Each set of 
contours shows the surface brightness of a model when the latter is viewed 
perpendicular to the disk; the dotted contours are for Freudenreich’s model, 
while the full contours are for the iterated model, which fits the data signif- 
icantly better. One striking difference between the two models is that, at 
small radii, the full contours are boxy while the dotted contours are pointy. 

Another important, but less obvious, difference between the models 
shown in Figure 10.6 is that the full contours are more crowded towards 
the center than the dotted contours. This phenomenon arises because the 
iterations have modified the bulge from an exponential profile (which yields 
uniformly spaced logarithmic contours) to a profile that is better represented 
by a power law. In §4.3 we saw that the luminosity densities of faint ellip- 
tical galaxies behave as power laws, j, ~ r~'%, at very small radii (see 
Figure 4.32). Interestingly, ground-based near-infrared observations, which 
have much higher spatial resolution than the space-based data, show that 
the luminosity density of the Milky Way conforms to this pattern in as much 
as, in the angular range 2” < R < 4°, the azimuthally-averaged K-band 
surface brightness fits a power law I(R) ~ R~°® (Eckart et al. 1994), which 
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naively implies j, « r—1® (§4.2.3). For Ro = 8kpc, 2” < R < 4° corresponds 
to 0.8pc Sr S 560pc. Beyond a few degrees, the surface brightness falls 
much more steeply: in the range —4° < b < —12°, Frogel et al. (1990) find 
that power laws I « |b|~* with index a in the range 2.5 < a < 3.4 fit the 
projected density of various tracers, with larger values of a associated with 
more metal-rich tracers. Thus it seems that, to within the substantial errors, 
the luminosity density of the inner Galaxy can be fitted by equation (4.15) 
with 7 ~ 1.8 and a ~ 0.56kpc. The dependence of a@ upon metallicity prob- 
ably arises because at 4° < |b| < 12° both the bulge and the halo contribute 
significantly to the observed tracers, and the surface density of the halo is 
falling less rapidly with |b| than is the surface density of the bulge. As we 
shall see below, the bulge has significantly higher metallicity than the halo, 
so metal-rich tracers should provide the truest picture of the surface density 
of the bulge. 


10.2.2 Evidence for a bar from individual stars 


Further evidence that the Galaxy is barred comes from studies of individual 
stars rather than integrated photometry. Unfortunately, most of the rel- 
evant observations have been made at optical wavelengths. Consequently, 
observations of bulge stars in fields that lie sufficiently far from the plane 
to be observable at optical wavelengths are important. Since the distribu- 
tion of dust is highly non-uniform, we can see much further in some di- 
rections that lie near the center than in others. One such direction, at 
(1,b) = (1°, —3.9°) is especially important: it is known as Baade’s Win- 
dow in honor of W. Baade, who discovered it. Along lines of sight that lie in 
Baade’s Window, the extinction to the point of closest approach to the cen- 
ter varies in the range 1.26 < Ay < 2.79 (Stanek 1996). Several important 
observational programs have involved studies of stars in Baade’s Window 
and similar windows on to the bulge. 

One class of studies of stars in Baade’s window involves monitoring the 
brightnesses of several million stars in order to catch the rare occasions when 
a star is gravitationally lensed by a foreground star — see §2.4. Such searches 
for microlensing events provide an important constraint on several character- 
istics of bulge stars. First and foremost, the optical depth to microlensing, 7, 
that a model predicts for a given line of sight increases with both the surface 
density, Xp, of bulge stars and the effective depth, dp, of the bulge along that 
line of sight. It is clear that + must increase with the surface density since 
lensing occurs when one star passes behind another star. The sensitivity of 
7 to the depth of the bulge arises because the cross-section for lensing, ares 
contributed by any given foreground star is greatest when it lies half way 
between us and the lensed star — see §2.4. Quantitatively, 7 is approximately 
proportional to Updp. 

Now suppose that we looked exactly down the long axis of a prolate 
spheroidal bulge and consider the variation in 7 as we elongate the bulge 
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whilst leaving its mass and mean density unchanged. Let q < 1 be the 
bulge’s axis ratio. Mass conservation implies that D}q?d? = constant while 
volume conservation implies q?d? = constant. Combining these relations, we 
find that r x d? « q— 4/3. Figure 10.6 suggests that, if the angle ¢p between 
the Sun—center line and the long axis of the bar is 15°, then g ~ 0.3 — 0.4. 
The calculation above suggests that in this case 7 will be 2 3.5 times larger 
than it would be if the bulge were spherical and equally massive. Moreover, 
the smaller ¢p is, the smaller the COBE photometry requires gq to be and 
therefore the larger 7 will be. In Baade’s Window Udalski et al. (1994) find 
7 = (3.34 1.2) x 10~® for 8.6d < t < 62d while at (1,b) = (2.3°, —2.65°) 
Alcock et al. (1995b) find + ~ 4 x 10~® for a similar range of values of 
t. Axisymmetric models of the Milky Way predict optical depths that are 
smaller by roughly a factor of 3 (Paczynski et al. 1994). Thus the fact that 
the measured value of 7 is of order 3 times larger than axisymmetric models 
predict, supports the proposition that ¢@9 ~ 15°. Zhao, Spergel & Rich 
(1995) find that a barred bulge of the type that is suggested by the COBE 
photometry yields rT ~ 2x 10~® in Baade’s Window, which is just compatible 
with the data of Udalski e¢ al.. 

Photometry of the millions of stars in these studies that are not lensed 
yields an additional important constraint. When these stars are plotted in 
the (V —I,V) plane, there is a pronounced clustering of stars at the location 
expected for a red clump star (§5.1.2) that is ~ 8kpc distant and obscured 
by Ay ~ 2mag of extinction. However, for fields at 1 ~ +5° and b ~ —3.5° 
the center of this clustering is brighter by 0.4 mag than for fields at the same 
latitude but | ~ —5° (Stanek et al. 1994). Bar models predict just such a 
difference because they imply that stars observed at | > 0 are, on average, 
closer than those observed at | < 0 — see Figure 10.5. Quantitatively, the 
two models shown in Figure 10.6 predict differences between the red-clump 
magnitudes for Stanek’s fields that amount to 0.29 mag for Freudenreich’s 
model, and 0.21 mag for the model shown by full contours. Thus, the COBE- 
based bar models predict values for Stanek’s magnitude difference that are 
smaller than the observed value. This discrepancy will probably be resolved 
by improved estimates of the extinction towards the Galactic center.* 

Further evidence that objects in the bulge that are seen at | > 0 tend 
to be closer than corresponding objects seen at | < 0 comes from studies 
of several classes of objects of known absolute magnitude in addition to red 
clump stars. These objects include AGB stars (Nakada et al. 1991, Wein- 
berg 1992), OH/IR stars (Sevenster 1996), and Mira variables (Whitelock & 
Catchpole 1992) 

As we saw in §2.4, microlensing events have a characteristic timescale t 
which depends upon the mass of the lensing star and the transverse velocities 
of the lensing relative to the lensed star. Consequently, microlensing surveys 


4 Wozniak & Stanek (1996) discuss the effects of a systematic variation in the extinc- 
tion across the Galactic center. 
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provide information on both the kinematics and the mass function of bulge 
stars. Unfortunately, the study of microlensing is still in its infancy and 
neither the models nor the data analysis has developed to the point at which 
the full potential of this technique can be exploited. 


10.2.3 Age and metallicity of the bulge 


Studies of the distribution in age and metallicity of bulge stars are funda- 
mental for our understanding of the structure and formation of all galaxies, 
including the Milky Way. The reason that such studies enjoy this fundamen- 
tal rdle is simply that the bulge is the only stellar system of its type that is 
close enough to permit the detailed study of individual stars. 

Rich (1988) analyzed the spectra of 88 bulge K giants in Baade’s Window 
and compared them with similar spectra for K giants in the disk and globular 
clusters. Subsequently, McWilliam & Rich (1994) obtained spectra of twelve 
of these stars at 14-fold greater resolution and concluded that Rich (1988) 
had overestimated [Fe/H] by ~ 0.3 for the most metal-rich stars. Figure 10.7 
shows a histogram of [Fe/H] for these stars on the revised scale of McWilliam 
& Rich (1994). The distribution is wide, reaching from [Fe/H] = —1.25 to 
[Fe/H] = 0.5 with mean value ([Fe/H]) = —0.25. We shall see below (§10.4) 
that this mean value is similar to that of solar-neighborhood K giants, and 
is considerably higher than the mean values of [Fe/H] for either the halo or 
the disk globular clusters (§10.5.1). 

On account of the width of the bulge’s metallicity distribution, the CM 
diagram for the bulge differs from that of any globular cluster. In particular, 
although it contains RR Lyrae stars and K giants similar to those in globular 
clusters, it also contains M giants such as we find in the solar neighborhood. 
Tiede, Frogel & Terndrup (1995) show that the morphology of the bulge’s 


5 Medium resolution spectra of a large sample of K giants confirm the metallicity 
distribution of Figure 10.7 (Minniti et af. 1995). 
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CM diagram is consistent with the value of ([Fe/H]) that McWilliam & Rich 
derived spectroscopically. 

The K giants analyzed by McWilliam & Rich (1994) have abundance 
ratios that differ from those in the Sun. The a element Mg and the iron- 
peak element Ti are enhanced relative to Fe; [Mg/Fe] ~ [Ti/Fe] ~ 0.3 at all 
values of [Fe/H]. On the other hand, [Si/Fe] ~ [Ca/Fe] ~ 0 even though 
both Si and Ca are a elements. Thus the abundance ratios in bulge giants 
are not naturally explained by the theory of §5.2. The reason for this failure 
is unknown. 


10.2.4 Bulge kinematics 


To probe the very center of the bulge one has to work at infrared or radio 
wavelengths because the inner few degrees are extremely highly obscured by 
dust at optical wavelengths. The advent of sensitive imaging infrared detec- 
tors has made infrared studies possible. A convenient spectral feature for 
velocity determinations is caused by absorption at 2.3 4m by CO molecules: 
this absorption trough has a very sharp edge — the CO bandhead — at its 
blue end, which can be accurately located in typical spectra. Sellgren et al. 
(1990) used this feature to study the dynamics of stars in the innermost 1' 
(2.3 pc) of the Galaxy, which is dominated by the nuclear star cluster (§9.5). 
They concluded that any systematic rotation within 2.3 pc has amplitude 
Ss 15kms~}. The measured velocity dispersion rises from oj,, ~ 70kms~2 
at R = 1! to oios © 125kms7! at R < 16” (0.6 pc). They found oj, to be 
constant inside R = 16”, but concluded that this constancy probably reflects 
an absence from the innermost 0.6 pc of stars with CO lines in their spec- 
tra: the strength of the CO bandhead decreases at R < 16” as if all the light 


6 Sadler, Rich & Terndrup (1996) derived a slightly smaller Mg enhancement in bulge 
giants from low-resolution spectra. 
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emitted within 0.6 pc of Sgr A* were free of CO lines. This conclusion is con- 
sistent with the idea that the central star cluster is dominated by early-type 
stars — see §9.5. 

Blum et al. (1995) used the CO bandhead to determine ojo, for the 
bulge in four fields, the most central of which lies at (1, b) = (—0.59°, 0.98°), 
which corresponds to a distance r ~ 160 pc from the center. They found 
evidence that ojo, rises as one approaches the center: in their innermost field, 
Clos = 153417kms!. This value is twice as large as the dispersion reported 
by Sellgren e¢ al. (1990) at r ~ 2.3pc. Evidently Blum et al. measured the 
velocity dispersion of the bulge, while even the outermost point of Sellgren et 
al. relates to the central star cluster, which is a dynamically distinct entity. 

Radial velocities are available for several hundred bulge stars that lie in 
Baade’s Window (Mould 1983, Rich 1988, Terndrup, Sadler & Rich 1995). If 
Clos i8 to be measured accurately, care has to be taken to eliminate from the 
sample foreground disk stars, which tend to bias the measurement towards 
low values. When this correction is made, one finds for the bulge ois, = 
110 +10kms"}. 

Minniti (1996) studied the kinematics of giants in a field at (1,b) = 
(8°, 7°), to which halo stars make a significant contribution. A sample of 
194 K giants with [Fe/H] > —1 had (vos) = 66 + 5kms-?! and ojo, = 
72 + 4kms™!, while the 31 most metal-weak K giants ({[Fe/H] < —1.5) had 
(Vios) = —6 + 20kms~* and ojo, = 113+ 14kms~!. If we identify the stars 
with [Fe/H] > —1 as bulge stars and the metal-weak stars as halo objects 
(see below), then these data indicate that: (i) the outward decrease in the 
bulge’s velocity dispersion that we encountered above continues to at least 
R-~1.5kpc, and (ii) the bulge has significant rotational velocity. Figure 10.8 
summarizes measurements of the line-of-sight velocity within the bulge and 
the nuclear star cluster. 

OH/IR stars (see Table 3.5) provide another important probe of the 
bulge because: (i) their detectability at infrared and radio wavelengths allows 
them to be studied throughout the Galaxy; and (ii) their velocities can be 
measured accurately. te Lintel-Hekkert et al. (1991) and Nakada et al. (1993) 
searched for maser emission from selected [RAS sources, while Lindqvist, 
Habing & Winnberg (1992) and Sevenster et al. (1997) made flux-limited 
surveys for 18cm maser emission. Unlike the radio-selected surveys, the 
TRAS-based surveys cover the whole inner Galaxy but suffer from a bias 
against sources near the plane. The velocity dispersions of the IRAS-based 
samples both lie near o}9, = 110kms~1. Both IRAS samples show clear signs 
of Galactic rotation; when a straight line is fitted to the values of vjog versus 
longitude 1, one obtains (vios) © 101 kms~', with | measured in degrees. The 
significance of this result is unclear, however, since the data are not well 
fitted by a straight line, and we do not expect the bulge to rotate like a solid 
body. 

The line profile of an OH/IR star is double peaked because the masing 
gas is contained in a shell that is expanding away from the central star. 
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The expansion velocity of this shell, which can be measured from the line 
profile, is thought to be a diagnostic of the age of the exciting star: initially 
more massive and therefore shorter-lived stars give rise to larger expansion 
velocities. When the radio-selected sample of Lindqvist, Habing & Winnberg 
(1992) is divided into two according to whether the expansion velocity is 
greater than or less than 18kms~!, the velocity dispersions of the subsamples 
are Clos = 82+ 7kms7! for the slowly expanding (old) stars and o,, = 
65+6kms! for the rapidly expanding (young) stars. The rapidly expanding 
stars are significantly more strongly concentrated towards the plane than 
are the slowly expanding stars. Both samples show evidence for Galactic 
rotation. In fact, the mean values of (vs X sign(l)) are very similar for this 
sample and for the samples of IRAS-selected OH/IR stars studied by Nakada 
et al. and te Lintel-Hekkert et al., even though the Lindqvist et al. sample 
covers a much smaller range in |l| (|l| < 1°) than do the IRAS-selected 
surveys. 

If the transverse velocities of bulge stars exceed 100 kms~?, their proper 
motions should be measurable: 120kms~! at a distance of Ro = 8kpc yields 
3.1 mas yr~!. Spaenhauer, Jones & Whitford (1992) have measured proper 
motions for 429 stars in Baade’s Window. These stars were selected to have 
B-V > 1.4, with the intention of including within the sample all K and 
M giants in the field.’ The proper motions measured by Spaenhauer et al. 
are relative, that is, the mean proper motion of the sample is unknown. The 
dispersions of 4; and y, within the sample are (o;,0,) = (3.2+0.1,2.8+ 
0.1)masyr~!. The subsample formed by stars fainter than B = 18 has 
only very slightly larger dispersions. Since any disk stars in the sample 
should be relatively nearby and therefore mostly brighter than B = 18, this 
result suggests that disk contamination is not a major problem. The rough 
agreement between the line-of-sight and transverse velocity dispersions shows 
that any velocity anisotropy within the bulge is minor. It does not follow that 
the velocity ellipsoids (see below) within the bulge are isotropic, however, 
because these ellipsoids will in most places not be aligned with the direction 
to Baade’s Window and the ! and 6 directions. 


10.3 Kinematics of stars near the Sun 


10.3.1 The solar motion 


We now fill a gap in our knowledge which we papered over in Chapters 2 
and 9: determining the velocity of the Sun relative to the LSR. Our strategy 
for this project is the following. We identify a series of types of spectroscopi- 
cally similar stars — gK stars, dM stars etc. — and then determine the velocity 


7 The first epoch plates were exposed by W. Baade in 1950. 
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of the Sun relative to the mean velocity of each type. Each such velocity is 
called the solar motion relative to that type of stars. We shall find that 
the solar motion varies systematically with the type studied, and that the 
velocity of the LSR can be inferred from this systematic variation. 

The solar motion can be determined from either radial velocities or 
proper motions — see Jaschek & Valbousquet (1991) and Dehnen & Binney 
(1998a) for recent determinations by each method. To see this, let v denote 
the velocity of an object in the frame of rest of the given type; in this frame 
the Sun has velocity vq and the average of any component of velocity, v;, 
over stars of the type is zero. Now the heliocentric velocity of the k™ star is 
Uk = VE — Vo, 80, by equation (2.23), its radial velocity is 


Vlosk = Xk * Vk — U@ COS Pp, (10.7) 


where x; is the unit vector from the Sun to the star and yy, is the angle 
between this vector and vq. When equation (10.7) is averaged over a large 
number of stars that are seen in almost the same direction X, we have that x,;- 
Vx approximately averages to zero because the mean of v, is, by construction, 
zero and X;, ~ X for all stars. Hence 


(vlos) & —Ueq cos w. (10.8) 


This equation states that the mean radial velocity will be largest when x 
points in the opposite direction to vg (¢% = 180°), and smallest when X 
and V@ are parallel (7 = 0) — in this direction the Sun is on the average 
approaching the stars, so they tend to have negative radial velocities. The 
direction of Vo is called the apex of the solar motion, and the opposite 
direction is called the antapex of the solar motion. 

Suppose now that we wish to determine vq from proper motions. We 
average equation (2.19) for a star’s proper motion over a large number of stars 
of the type that are all seen near the direction of x and all lie approximately 
at distance d, so that |x;,| ~ d: 


iepe (ee — Vo) X Xe) X St 


xx| 
~ ((ve = Vo)) x &) x & 
- d (10.9) 
= -F(vo x Xk) xX X& 


= a(vo — Ve cos wx). 


From this equation we may deduce the physically obvious fact that the mean 
proper motion of members of the type vanishes in the direction of either the 
apex or the antapex of the solar motion (w% = 0, 180°), and is largest in the 
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Box 10.1: Determining vg by Least-Squares 


It is not immediately obvious from equations (10.8) and (10.9) how vo 
should be determined from a given body of data. We address this problem 
for the case of radial velocities — a closely analogous procedure works for 
proper motion data (Dehnen & Binney 1998a). We start by rewriting 
(10.7) in the form 


RE VE = Xe* VO +t Viosk (k=1,...,N), 


where we assume that N stars have been observed. These N equations 
have infinitely many solutions, since if v; and Vo solve them, so do 
vi, = Ve + V and vo + V; this change corresponds to a simple boost 
in our reference frame. To pick out the solution of interest, we have to 
impose the condition that the v,z have zero mean. Suppose that this 
condition is satisfied in the unprimed frame. Then it is easy to see that 


N N N nf 
= die V4) = Sok ‘vE)? + So (ke ‘Vy? > So (& -V_)?. 
k k k ; 


It follows that the desired frame is that which minimizes S. Therefore 
we determine Vo by minimizing S = oy (Ke ‘Vo + Viosk)” with respect 
to the three components of vq. That is, we solve the three simultaneous 
linear equations 


as x 
O= 5 =2) (kk Vo t+ Mosn)Rei — (¢ = 12,3) 
Voi k 


for the three components v¢;. 


Figure 10.9 Definition of a coordinate system centered on the 
Sun. 


perpendicular directions (7 = 90°). Moreover, if we further average equation 
(10.9) over stars that lie in the same direction but at different distances d, it 
remains true that (jz) vanishes in the apex and antapex directions. Hence, 
the direction of the solar motion can be deduced from proper motions even 
if one does not know any stellar distances. To obtain the magnitude of vo, 
however, it is essential to have a distance estimate for each star in one’s 
sample. 

Box 10.1 and Problem 10.5 describe the mechanics of extracting the 
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Figure 10.10 The three components of the solar motion as func- 
tions of B — V. [After Dehnen & Binney (1998a)] 


solar motion from a large sample of stellar velocities. Figure 10.9 defines 
the coordinate system that is generally used in kinematic studies of the 
solar neighborhood. The x direction points towards the Galactic center, 
the direction of Galactic rotation is the y direction, and the z direction 
points towards the NGP. It is conventional to use the letters U,V and W for 
Uz, Vy and vz, respectively. Figure 10.10 plots the U,V and W components 
of the solar motion relative to MS stars for which the Hipparcos satellite has 
determined accurate parallaxes. Different colors yield fairly similar values 
of Us and Vo, so we may estimate the radial and vertical components of 
the solar motion with respect to the Galactic center by averaging individual 
values of Up and Vo in Figure 10.10. For reasons that will emerge below, 
we exclude the bluest data-point from this average and find the radial and 
vertical components of this motion to be 


Up = voz = 10.040.4kms™", 


io (10.10) 
Wo = voz = 7.2+0.4kms 

In Figure 10.10 it is striking that for B — V < 0.61, Vo is increases 
steadily with increasing B — V, while for B—V > 0.61, Vo is independent 
of B—V. Figure 10.11 shows that these variations in Vo are linearly related 
to the squared random velocity S$? for each stellar group — see Problem 10.5 
for the definition of S?. In fact, theory predicts this dependence of Vo on 
S? — see equation (4-35) of BT. The straight line in Figure 10.11 shows a 
suitable linear fit to the data, with the data point for the bluest stars again 
excluded from the fit. From the y intercept of this line we infer the value, 
5.2+0.6kms_!, of Vo for a hypothetical stellar type that had $? = 0. In 
§9.1 we defined the local standard of rest (LSR) as the velocity of the closed 
orbit within the plane that passes through the present location of the Sun. 
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Figure 10.11 The V component of the solar motion relative to 
different stellar types is a linear function of the random velocity S? 
of each type. [After Dehnen & Binney (1998a)] 


Since a class of stars that moved on such closed orbits would have S? = 0, we 
conclude that the solar motion relative to the LSR has Vo = 5.240.6kms7!. 
In summary, the Sun’s motion relative to the LSR is 


Us = 10.040.4kms + 
Vo = 5.2+06kms! } > |vo|=13.4kms!. (10.11) 
Wo = 7.2+0.4kms7! 


Thus, the Sun is moving in toward the Galactic center and up toward the 
north Galactic pole and away from the plane. It is also moving around 
the Galactic center faster than it would if it were on a circular orbit. It 
follows that the Sun is currently inside its guiding-center radius R, and is 
approaching, but has not yet quite reached, the pericenter of its orbit — see 
equation (9.7) and §3.3.3 of BT for a discussion of orbits like the Sun’s. 
From this point on we will whenever possible refer the velocities of stars 
to the velocity of the LSR rather than that of the Sun. In particular, we 
shall assume that the means (U) and (W) vanish for stars near the Sun. 
The systematic trend of Vo with S? shown in Figure 10.11 is a reflection 
of a phenomenon called asymmetric drift, which is the tendency of the 
mean rotation velocity of a stellar population to lag behind that of the LSR 
more and more with increasing random motion within the population — see 
§10.3.2 below and §4.2.1 of BT. When the Sun’s velocity is referred to such 
a lagging reference frame, it acquires a value of V which grows as the lag 
increases. It is easy to see that the magnitude of the asymmetric drift, V,, of 
any stellar type is given by the difference between that type’s y-coordinate in 
Figure 10.11 and the y-intercept of the straight line through the points. We 
shall see that the squared random velocity, S?, is linearly related to CU), 
so a linear relation holds between V, and (U 23 This relation takes the form 


2: /4UE) 


Yas 80+5kms-! (10-12) 
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(Dehnen & Binney 1998a). The bottom curve in Figure 10.12 shows the 
variation of V, with B — V for MS stars. 


10.3.2 Random velocities of stars 


As we have already seen, important dynamical information is contained in 
the dispersion of velocities, 


04 = ((v; — (vi))?)"?, (10.13) 


of each stellar type about the mean velocity of that type. The upper three 
curves in Figure 10.12 show, as a function of B — V, values for oz, oy, and 
o, for MS stars. For every color we have oz > dy > oz. Roughly speaking, 
a, ~ 0.50, while the ratio o,/o, lies in the range from 0.55 to 0.7. The fact 
that o, # o, has important dynamical implications — see Chapters 3 and 
4 of BT. The value of the ratio o,/o, has often been used to constrain the 
shape of the Milky Way’s circular-speed curve — see below. 

In Figure 10.12 all three velocity dispersions increase with B — V color 
up to B—V ~ 0.6 and are approximately constant redward of B— V ~ 0.6, 
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Figure 10.13 Orientation of the velocity ellipsoid. 
The direction to the Galactic center is upwards and 
the Sun lies at the center of the ellipse and moves 


By towards the left. A portion of its orbit is shown. 
1 


VR 


just as the asymmetric drift V, increases with B - V to B— V = 0.61 and 
then remains constant. From the theory of stellar evolution (§5.1), we know 
that MS stars bluer than B — V ~ 0.6 are all younger than ~ 10Gyr. In 
contrast, redder stars are a mixture of a few young stars and mostly old 
stars, and the fact that they have systematically larger velocity dispersions 
suggests the operation of a mechanism that leads to a progressive increase 
of the dispersion with time. We shall return to this idea in §10.7 below. 
Vertex deviation In addition to squares of velocity components such 
as v2, one can average products of velocity components such as v;(vy — 
(vy)). For all stellar types, the averages of the two products of this type that 
involve v, are smaller than the errors. However, for many stellar types the 
average of the third product, vz (vy —(vy)), is significantly different from zero. 
Evidently for stars of these types vz and vy are not statistically independent: 
if (vz (vy —(vy))) > 0, then if vy — (vy) is measured for some star and found to 
be positive, that star is more likely have a positive than a negative value of vz. 
When confronted with such correlated observables, it can be helpful to find 
linear combinations of the observables which are statistically independent. 
Therefore we define 


U1 = Uz COS ly a (vy oy (vy)) sin ly, (10.14) 


v2 = vz sinly + (vy — (vy)) cosy, 


where the angle l, is called the vertex deviation. Now on multiplying these 
equations together we find that 


(v1v2) = (02 — 02) sin Ql, + (ve(vy — (vy))) cos 2ly. (10.15) 


Hence v1 and ve will be statistically independent in the sense that (v;v2) = 0 


if we set, 
L= parctan( "2(Ge— Cu) (10.16) 
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Box 10.2: The Velocity Ellipsoid 


Suppose we wish to determine the mean- 
square speed in a direction fi that lies 

in the plane but makes an angle w~ with 
the v,-direction. (The direction to the 
Galactic center is upwards.) Then since 
the velocity in the direction of fi is vz = 
v1 cosy + v2 sin w and (v;v2) = 0, we have 


(v2) = (v2) cos? w + (v2) sin? 
= (v?) (cos? w+’ sin” w), 


Vy 


where q is the axis ratio of the ellipse shown in the figure. It is easy to 
see that the coordinates of the point marked with a blob are (v1, v2) = 
(v?)1/2 x (qsin,cosw). Hence (v2)1/? is the distance from the origin to 
the blob. 


The arctan function introduces some ambiguity in the definition of l,. By 
convention this ambiguity is resolved such that o1 = (v?)1/? > og = (v3)1/?. 

The linear transformation (10.14) is a rotation of coordinates, so v1 
and v2 are simply components of velocity in directions that are inclined 
by l, to the center-anticenter direction. Figure 10.13 shows a geometrical 
interpretation of what is going on. If we draw the ellipse marked with semi- 
axes 0, and oo, then the mean-square of the component of velocity in any 
direction can be obtained by the construction described in Box 10.2. In fact, 
we can generalize this construction by replacing the circle in Box 10.2 with a 
sphere and the ellipse with the ellipsoidal surface that has semi-axes oj, 2 
and o,. This surface is called the velocity ellipsoid. For all types of stars in 
the solar neighborhood, one of its principal planes coincides with the plane 
of the Milky Way, but its longest axis deviates from the center-anticenter 
direction by the vertex deviation l,. Table 10.2 gives, as a function of B—V, 
values of the o; and values of l,. The latter decreases with increasing B — V 
from ~ 30° for the bluest stars to ~ 10° for stars redder than B —V ~ 0.45. 
We discuss the causes of the vertex deviation below. 

Our discussion of solar-neighborhood kinematics has focused on MS 
stars because the largest body of accurate and homogeneous data is available 
for these objects. Table 10.3 shows that other stellar groups display exactly 
the same general kinematic trends as MS stars, however. In particular, older 
stellar groups are characterized by larger velocity dispersions and asymmet- 
ric drift velocities than are younger stellar groups, and vertex deviation tends 
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Table 10.2 Velocity dispersions and vertex deviations for MS 
stars 


(B-V)minmax 01/kms7! 02/04 o,/01 Ll, /deg 
—0.238 0.139 14.35'04° 0.65100" o.gethS s0.9tt! 
0.139 0.309 20.17%859 o.47+0-0? o.40tho2 99.9t28 
0309" 0.412 22.39°722 10.584) 08 Odor 16.atet 
G42: 0.472 26.267). 0.607088 O4eree 19078" 
O472: 0605" S0:37203.-"0.00tr 4 Cairo eotes 
O:525" 0.582: 39.037. 0r -O.66r ee Cab toe 19Fe9 
0.582 0.641 37.6411-37 o.62T078 osettol 10.0t88 
O41. O.719- BBASEOTE: TO622 C8 Oats: eres 
O71; 1543. “37202282. 607 08: O.4ot ee 3.4 et 
0.610 1.543 37.9179-78 0.651903 0.544002 10,3438 

SOURCE: Data published in Dehnen & Binney (1998a) 


to decline with increasing age. 


The Schwarzschild distribution Each component of the velocity dis- 
tribution of, say, a population of oxygen molecules at room temperature 
has a Gaussian probability distribution. Schwarzschild (1907) pointed out 
that a similar probability distribution can account for many aspects of the 
probability distribution of stellar velocities. The main difference between 
the case of molecules in air and stars is that in the former case the veloc- 
ity dispersion is independent of direction, whereas, as we have seen, in the 
stellar case the dispersion of a component of velocity depends strongly on 
direction. Schwarzschild postulated that the probability that the compo- 
nents of velocity (v1, v2, vz) defined above lie in the element of velocity space 
d'v — dv ,dvu2dv, is 


d§v ve v3 v2 
P(v)d?v = (an) 5?o,0s0, exp | ( eo as ale (10.17) 


This probability distribution is know as the Schwarzschild distribution. 
In §7.5 of BT it is shown to have a natural dynamical interpretation. Notice 
that P is constant on ellipsoids in velocity space. 

Figure 10.14 shows the distributions in U,V and W of two samples of 
nearby stars for which accurate radial velocities and Hipparcos parallaxes 
are available.2 The upper panels are for MS stars bluer than B — V = 0.34 
~ these are mostly A stars. The lower panels are for K and M dwarfs from 
Vyssotsky (1963). From the shapes of the upper histograms it is plausible 
that the Schwarzschild distribution gives a reasonable model of the velocity 


8 See Jahreiss & Wielen (1997) for details of these samples. 
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Table 10.3. Kinematics of non-MS stars 


Asymm. Dispersions Vertex 
Stellar type drift, Vi on of oo, dev., l,(°) 
Giants 
A 6.4 22 #13 9 27 
F 13.3 28 «15 9 14 
G 5.9 26 18 15 12 
KO 13.4 31 21 16 14 
K3 11.5 31 21 17 4 
M 13.1 31 23 16 7 
Supergiants 
Classical Cepheids 6.8 13 9 5 - 
O-B5 8.2 12 #11 9 36 
F-M 6.5 13 9 7 18 
Other 
Carbon stars 27 48 23 16 _ 
Subgiants 23 43. 27 24 - 
Planetary Nebulae 24 45 35 20 - 
White Dwarfs 32 50 30 25 ~ 
Variables, P > 3004 22 50 40 30 ~ 
Variables, P < 3004 37 = 80 «-60~—s: 60 - 


RR Lyrae, P < 0.454 26 45 40 25 - 
RR Lyrae, P> 0.454 220 160 100 120 - 
Subdwarfs 145 100 75 50 - 


SOURCE: From data published in Delhaye (1965) 


distributions of the A stars. However, from the shape of the lower histogram 
of V components it is clear that Schwarzschild’s model cannot provide an 
adequate representation of the data for stellar types that have higher ve- 
locity dispersions, such as M dwarfs. Specifically, it is inherently unable to 
reproduce the characteristic asymmetry of the measured distribution of V 
components: whereas the distribution cuts off sharply at positive V, it has 
a long tail towards negative V. 

The origin of this asymmetry in V is easy to understand. Solar-neighbor- 
hood stars with negative V have less tangential motion than is required 
to be on a circular orbit at Rg. Hence they are closer to apocenter than 
pericenter, and we may think of them as being at home interior to the solar 
circle. Moreover, the smaller a star’s value of V is, the further inside Rg its 
home lies. Now from §4.4 we know that the surface densities of galactic disks 
increase exponentially towards the center. Moreover, in §11.3.2 we shall find 
that the velocity dispersions within a disk also increase exponentially as we 
move inwards. For both these reasons we expect many more stars to visit us 
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Figure 10.14 Upper panels: histograms of U,V and W for a sample of 323 nearby MS 
stars of MK type F1 and earlier. Lower panels: similar data for 510 K and M dwarfs. 
Velocities are with respect to the LSR that is defined by equations (10.11). [From data 
kindly supplied by H. Jahreiss] 


in the solar neighbourhood from small radii than from large radii: not only 
does the density of tourists increase inwards, but so does the size of their 
travel budgets and therefore the distance from which they can visit us. The 
skewness of the V distributions in Figure 10.14 is simply a reflection of these 
basic facts. The asymmetric drift discussed above is another consequence 
of these facts, for the skewness of the V distribution drags the mean value of 
V for a stellar type to more and more negative values the larger the type’s 
velocity dispersions and therefore the skewer its V distribution. 


Star streams’ Figure 10.15 shows the densities of MS stars of various 
colors in the (U,V) plane. Each panel is for a different range in B—V, with 
the bluest and youngest stars at top left and the reddest and oldest stars at 
bottom right. We can immediately recognize in these plots phenomena with 
which we are already familiar. 
1. As one proceeds to redder groups, the distribution of stars spreads. The 
increase in the o; with B — V quantifies this phenomenon. 
2. Three of the maps show a clear tendency for the region of highest stellar 
density to be elongated along a line that slopes from bottom left to top 
right. The vertex deviation quantifies this effect. 


° For mathematical models of the V distributions, see Cuddeford & Binney (1994). 
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Figure 10.15 The density of stars near the Sun in velocity space. Each panel shows the 
density of MS stars projected onto the (U,V) plane for a different range of B — V color, 
with the bluest stars at top left and the reddest stars at bottom right. The Sun’s velocity 
is at U = V = 0 and the velocity of the LSR that is defined by equations (10.11) is marked 
by a triangle. [After Dehnen (1998) from data kindly supplied by W. Dehnen]} 


3. The lower two distributions clearly peak at V < 0 and extend further to 
negative V than to positive V. The asymmetric drift and the skewness 
of the V distributions in Figure 10.14 reflects this asymmetry. 


In addition to these familiar phenomena, the maps of Figure 10.15 reveal 
the presence of tight clumps of stars in velocity space over and above the 
large-scale structure that the Schwarzschild distribution reproduces. In this 
connection, it is important that in Figure 10.15 each map is constructed from 
a separate sample of stars. Consequently, structure that is present in more 
than one map is almost certainly real rather than an artifact introduced by 
sampling noise. 

The top right-hand panel of Figure 10.15 shows four strong peaks in the 
stellar density. The peak at (U,V) = (—10, —5) kms7? is associated with the 
LSR, which is marked with a triangle. The peak at (U, V) = (9,3) kms~} cor- 
responds to the Sirius stream, that at (—12,-22)kms~! to the Pleiades 
stream and that at (—35, -18)kms~! corresponds to the Hyades stream. 
Careful examination of the lower two panels of Figure 10.15 reveals that 
several wiggles and lumps that might be dismissed as noise if they oc- 
curred in only one plot, are evident in both plots and therefore probably 
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reflect real concentrations of stars - examples are evident near (U,V) = 
(—40, —45), (15, —60) and (0,—100) kms“. 

These streams, or moving groups as they are often called, are thought 
to be vestiges of the clusters and associations in which most stars form 
(§6.2.2). Long after their stars have drifted far apart and they are no longer 
easily distinguishable by their space concentration, the U and V velocities of 
their stars remain very similar and betray the stars’ common origin. Conclu- 
sive proof of this common origin comes from the fact that when all the stars 
of a moving group are plotted in a CM diagram, one sees the characteristic 
structure of the CM diagram of a coeval cluster (§6.2.2 and Eggen 1965). 

Given these facts, one would expect moving groups to be most important 
for young stars and therefore most apparent in the distribution of early-type 
stars. This expectation is borne out by Figure 10.15, the top left panel of 
which, for stars bluer than B — V = 0, is dominated by just two moving 
groups. Moreover, the brightest early-type stars in the night sky do not 
lie in the Galactic plane, but rather are found in Gould’s belt in a plane 
inclined by about 16°. A detailed kinematical analysis of these stars (Lesh 
1968) suggests that they form an expanding group. 

Interestingly, Dehnen (1998) shows that star streams do not show up 
clearly in plots that involve the vertical component of velocity, W. This fact 
presumably arises because the Galactic potential changes most rapidly in the 
vertical direction with the consequence that as the stars of an association 
drift apart, they experience significantly different vertical forces when they 
are still subject to very similar radial and azimuthal forces. Clearly, once two 
stars experience very different vertical forces, their vertical velocities rapidly 
move apart. 


Causes of vertex deviation If we imagine our Galaxy to be an axisym- 
metric system in a steady state in which stars are distributed at random in 
their orbits, then, from considerations of symmetry alone, we would expect 
to find one axis of the velocity ellipsoid of stars in the Galactic plane pointing 
exactly at the Galactic center. This expectation is borne out by dynamical 
theory (see BT §7.5), but, as was mentioned earlier, the vertex deviation, L,, 
often differs significantly from zero. 

The existence of moving groups is significant for our understanding of 
the vertex deviation for the following reason. If a significant fraction of the 
objects upon which the numbers in Table 10.2 are based are, in fact, mem- 
bers of a few distinct moving groups, then the number of truly independent 
velocities employed is strongly reduced, and the statistical errors in the re- 
sults will be seriously underestimated. In these circumstances, estimates 
of the orientation of the velocity ellipsoid axes and the velocity dispersion 
along these axes will be noisy. For example, if in Figure 10.15 the Pleiades 
stream lay at (+12,—20)kms~! rather than at (—12,—20)kms~!, the ver- 
tex deviation would be drastically reduced, especially for stars bluer than 
B-V=06. 
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It is not clear, however, that moving groups are entirely responsible for 
the non-zero values of l, in Table 10.2. A significant additional cause of 
vertex deviation could be the non-axisymmetric component of the Galactic 
potential. In §4.4.6 we encountered evidence that spiral structure affects 
the distribution of old as well as young stars, and in §8.2.3 we saw how the 
resulting spiral perturbation in a galaxy’s gravitational potential imposes a 
pattern on the velocity field of the ISM. The velocities of stars must also be 
changed by the same spiral gravitational field. One can show that stellar 
types with small random velocities will suffer larger velocity changes than 
types with large velocity dispersions — see §6.2 of BT. One may also show that 
the spiral field will produce a vertex deviation (Kuijken & Tremaine 1994). 
Thus observations of spiral structure and dynamics together predict that all 
stellar types will show a vertex deviation, but that this will be largest for 
stellar types with low velocity dispersions, just as is observed. It is not yet 
clear whether spiral structure can account satisfactorily for all the observed 
vertex deviations, or whether moving groups also play an important rdle. 


10.3.3 The Oort constants 


Before we go on to discuss correlations between the kinematic properties 
of stars and their ages and chemical compositions, we should plug a gap 
in our discussion. This gap concerns differential rotation within the solar 
neighborhood. We have proceeded above as though the solar neighborhood 
is of negligible size compared to the Milky Way, so that one standard of rest 
applies throughout. Actually, the ‘solar neighborhood’ must be considered 
to be a sphere large enough to contain an adequate sample of whatever stars 
are under investigation. If these stars are intrinsically rare, for example most 
kinds of giants, the sphere may have to have a non-negligible radius and the 
effects within it of differential rotation may be significant. 

We now show how these effects may be quantified and thus allowed for in 
kinematic studies. One valuable spin-off of the analysis will be evidence that 
the mean motion at each point of the disk is circular rather than significantly 
elliptical, for example. Another will be important information regarding the 
circular-speed curve of the Milky Way. 

By analogy with the definition of the LSR (see §9.1), we associate a 
standard of rest (SR) with any point in the disk: vgp(x), is the veloc- 
ity at x of the closed orbit that passes through x. With this definition, 
VLSR = Vsr(Xo). If the Milky Way were axisymmetric, vgp would be every- 
where perpendicular to the local direction to the Galactic center and equal 
in magnitude to the local circular speed vu,(R). 

By a direct extension of the logic employed in §10.3.1, we argue that 
at any point in the disk, vsp(x) should lie close to the mean velocity at x 
of any stellar type that has small random velocities. Hence key information 
regarding vsp can be gleaned from studies of the proper motions and radial 
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velocities of stars near the Sun. With this idea in mind, we now ask how the 
variation of vsp from point to point will be reflected in the pattern of radial 
velocities of stars near the Sun. 

Let 6v(x) = vsr(x) — visr and let the (x,y,z) coordinate system be 
defined as in Figure 10.9. Each of the two components of év is a function of 
the two variables (x, y) and may be Taylor expanded about the origin of the 
(x,y) system. Since dv vanishes at the origin by construction, we may write 
this series in the form 


O6v, Oédbvz 
bu, \ _ Ox Oy x Do 9 
(fe) = Ivy Bry (5) +0 ty) 
oe. a (10.18) 


= k+e a—b x 2 2 
~ Ca = (5) +0 ty"). 


Here the partial derivatives are evaluated at the origin and a, b,c, k are linear 
combinations of the values of these derivatives — the reason for writing the 
partial-derivative matrix in this way will emerge shortly. We expect the 
mean radial velocity of the stars at x = (z,y) to be the projection of 6v on 
to X: 


1 
Vlos = a - dv 


: (10.19) 

~S [(k + c)a? + (k — c)y* + 2axy] 
where d = (x? + y”)1/? is heliocentric distance as usual. In terms of Galactic 
longitude 1, we have x = dcosl and y = dsinl, so with the aid of two 
trigonometric identities equation (10.19) becomes 


Vlos = d(k + ccos 21 + asin 21) + O(d?). (10.20) 


This equation suggests that we could determine the constants a, c and k by 
measuring the radial velocities of stars of similar distances and plotting the 
results as a function of longitude J. The stars could be selected to be stars 
of given spectral type and apparent magnitude, thus guaranteeing that they 
all have similar distances. 

The constant b can be determined from the proper motions of stars. To 
see this, let 6u, be the component of velocity perpendicular to the line of 
sight. Then F 

6u = qa x Ov), = q (tery — ydvz) (10.21) 
= d(b + acos2l — csin 21) + O(d?). 


Hence from a plot of the proper motion py; = v;/d as a function of | we can 
determine a, 6 and c. 
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The relations (10.20) and (10.21) above are completely general — they 
assume only that vsp varies smoothly in the solar neighborhood. What 
do they reduce to if we assume that the disk is in circular rotation with 
angular speed 2.(R)? For this case, equation (9.3) gives an exact relation 
for Vios(1, 2). To obtain a relation equivalent to equation (10.20), we need 
to Taylor expand the right-hand side of equation (9.3) in powers of d. On 
Taylor expanding (RF) to first order in R — Ro, equation (9.3) yields 


dQ 
Vlos(I, R) = dR (R =. Ro) Ro sinl 
sid (10.22) 


= -2A(R- Ro) sin I, 


where Oort’s constant A is defined by 


dQ v dv 
A= (RS s( 7 Ve (10.23) 
0 0 


On neglecting d? in equation (9.16), we have 


(R— Ro)(R+ Ro) = R? — R? ~ —2Rodcosl, (10.24) 
so approximating (R + Ro) by 2Ro, we may write to first-order in d 
(R — Ro) = —dcosl. (10.25) 
Combining equations (10.22) and (10.25) we find 
UVlos & Adsin 21. (10.26) 
This is equation (10.20) for the special case of circular rotation. Evidently 
in this case c= k = 0, and a = A is given by equation (10.23). 


To derive equation (10.21) for the case of circular rotation we have to 
go back to equation (9.1). We write 


——— x [A(R) x R — Q(Ro) x Ro]) 


z 


x {M(R) x (R~ Re) + [M(B) — O(Ro)] x Ro}) 


(R —Ro)- Ro 
— 


Ro 
(10.27) 
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Box 10.3: The AR) Formula 
We can derive a useful formula from equation (9.4) for the terminal 
velocity (see §9.2.3) in the direction | by writing 
dQ 


Q(Ro sin!) — Q(Ro) ~ (aE), Ro(sinl-—1) for 1~90°. (1) 


Substituting for dQ/dR from (10.23), equation (9.4) becomes, to first 
order in (1 —sinl), 


off) (1) =2AR,(1—sinl) for 1~ 90°. (2) 


This formula is useful because it links two quantities that are key but hard 
to measure accurately, A and Ro, to the terminal velocity curve, which 
can be measured quite accurately — see §9.2.1. Kerr & Lynden-Bell (1986) 
list several determinations of ARg, and find that recent observations give 
ARy = 108+3kms~1. The data plotted in Figure 9.16 yield a rather 
larger value, ARp ~ 115kms~1, but with considerable uncertainty. 


By drawing the appropriate triangle it is easy to see that (R — Ro)- Ro = 
—dRo cosl. When we substitute this relation and equation (10.25) into equa- 
tion (10.27), we find 


vu, ~ ,(Ro)d + dRo ON cog 
ele (10.28) 
dQ, 
= d(.(Ro) + }RoZe|_ (1+ cos 21)). 
Ro 


We now recall that, because the Milky Way rotates clockwise, 2, = —{, and 
define Oort’s constant B by 


dQ, 
B= -(9 +4R ) 
dR/ Ro 
(10.29) 
are (F + ) 
2\R" GR), 
Equation (10.28) then becomes 
a: 
w= = B+ Acos2l. (10.30) 


This is equation (10.21) for the case of circular rotation. Evidently in the 
case of circular rotation b = B is given by equation (10.29). 
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A measures the shear in the disk at the Sun. It would be zero if the 
disk rotated like a solid body, such as a compact disk, for then Q would be 
independent of radius and, by equation (10.23), A would vanish. B measures 
the vorticity of the material of the disk, that is, its tendency to circulate 
about any given point — see Problem 10.8. From equations (10.23) and 
(10.29) it immediately follows that 


ye) = ofA: (10.31) 
Ro 


Ue = Ro(A-—B) and 


In §8.2.4 we saw that the circular-speed curves of spirals are flat or gently 
rising at large radii. If the Milky Way’s circular speed curve followed this 
pattern at the Sun, we would have A+ BS 0. 


Estimating the Oort constants Kuijken & Tremaine (1991) reviewed 
the observational constraints on c and k, and concluded that 


c=0.6+1.1kms~tkpce™?, k = —0.35+0.5kms7'kpe*. — (10.32) 


Thus these two constants are zero to within the errors, just as they should 
be if the Milky Way is axisymmetric. In view of this result, we shall assume 
that a = A and b = B, and henceforth make no distinction between these 
logically distinct entities. 

A can be determined either from radial velocities [equation (10.20)], or 
from proper motions [equation (10.21)]. Since the determination of v,(Ro) is 
one of the harder problems in Galactic structure, equation (10.23) is usually 
used to determine v,(Ro) from A rather than vice versa. Notice that the value 
of A derived from radial velocities is inversely proportional to the assumed 
distances d of the observed stars, and must be updated whenever there is an 
improvement in the accuracy of the standard distance scale. 

Fundamentally, B can only be obtained from proper motions. Tradition- 
ally, equation (10.21) has been used. The great drawback of this technique is 
that any rotation of one’s astrometric system (§2.1.5) will directly contribute 
to the measured value of B. To show this, we suppose that our astrometric 
system rotates at angular velocity w, and calculate the contribution dy, that 
this rotation makes to the left-hand side of equation (10.21). The rotation of 
the system adds x x w to the measured proper motion of a star that lies in 
the direction %. We wish to calculate the component of this proper motion 
that is perpendicular to the normal to the Galactic plane, Z. This component 
is 

ZX [(KXw) x Z=(2-R)ZxXwW-(B-wW)ZxK, (10.33) 


For stars that lie within the plane, z-x ~ 0 and Z x X is just the unit vector 
in direction of increasing longitude /. Hence 


5) = —%-w, (10.34) 
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which is obviously independent of |. However, in equation (10.21), B is 
precisely the term that is independent of 1, so rotation of one’s astrometric 
system directly affects the measured value of B. Moreover B is small; ex- 
pressed in sensible units (like A and the Hubble constant it is a frequency) 
its value is ~ —4 x 10-1® Hz. So one has to be very sure that one’s frame 
is not rotating before reporting a measurement of B! It is a tribute to the 
diligence of astronomers that the errors in astrometric systems such as the 
FK5 that are based on the dynamics of the solar system (§2.1.5), are only of 
this order. Even so, values of B that are based upon an extragalactic system 
such as the ICRS (§2.1.5) are clearly to be preferred. 

Kerr & Lynden-Bell (1986) reviewed the observational constraints on A 
and B and concluded that 


A=1444+1.2kms"'kpe’, B=-12.042.8kms~!kpc™!. (10.35) 


From the proper motions of Cepheids that were measured by the Hipparcos 
satellite (§2.1.3), Feast & Whitelock (1997) found 


A=14.8+0.8kms"'kpe7', B=-12.4+0.6kms~'kpce*. (10.36) 


Since Hipparcos measured proper motions on the ICRS, these estimates are 
the most reliable values of the Oort constants, and should supersede all 
earlier values. Since A > |B| they imply a gently falling circular-speed curve 
at Ro. Numerically, v.(Ro) = 218(Ro/8kpc)kms~!. 

Recently an intriguing alternative to equation (10.21) has arisen, namely 
measurement by the VLA of the proper motion of the compact radio source 
Sgr A*, which is generally thought to mark the Galactic center (§9.5). Radio 
observations at seven epochs between 1981 and 1994 of the position of Sgr 
A* relative to extragalactic sources show that it has proper motion (Backer 
1996) 

(uu, Hy) = (—6.55 + 0.17, —0.48 + 0.12) mas yr~?. (10.37) 


This proper motion is the sum of the proper motion of Sgr A* with respect 
to the Galactic center and the reflex of the Sun’s velocity, which is itself 
the sum of v.(Ro) and the solar motion with respect to the LSR. When 
we subtract the contribution of the latter for Ro = 8kpc, we are left with 
(uj, 44) = (—6.24 + 0.17, —0.30 + 0.12) masyr~!. Clearly yu}, must reflect the 
intrinsic motion of Sgr A*, and the fact that it does not differ significantly 
from zero encourages us to believe that Sgr A* is essentially stationary with 
respect to the Galactic center. In this case, from the first of equations (10.31) 
it follows that 

pi = A- B=30.140.8kms~'kpc™", (10.38) 


which is marginally inconsistent with equation (10.36). 
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10.4 The structure of the stellar disk 


In the last section we saw that different spectral types have characteristically 
different velocity dispersions. The discovery of this and related correlations in 
the 1950s constituted one of the major advances in our understanding of the 
structure and history of the Milky Way. In this section we investigate these 
correlations in detail and explore their implications for Galactic structure 
and evolution. 

We speculated above that the correlations between spectral type and 
kinematics might be connected with the different mean ages of stars of dif- 
ferent spectral types. We now pursue this hint by looking at samples of stars 
to which ages can be assigned. 


10.4.1 Ages and metallicities of nearby stars 


The stars which can most readily and reliably be assigned ages are dwarfs 
of MK type G and earlier: such a star begins to turn off the ZAMS within 
10 Gyr (§5.1), and as it moves away from the ZAMS it can be dated from 
its position in the CM diagram. Moreover, an interesting upper limit can 
be placed on the age of any such star that is still on the ZAMS. For our 
present purposes, F and G stars are most interesting because their lifetimes 
are comparable to the long time-scales characteristic of Galactic evolution. 
Edvardsson et al. (1993) report the results of a major study of such 
objects. They obtained high-quality spectra and space velocities for 189 
nearby F and G dwarfs. By fitting sophisticated model atmospheres to their 
spectra they obtained for each star an age and the abundances of the elements 
O, Na, Mg, Al, Si, Ca, Ti, Fe, Ni, Y, Zr, Ba and Nd. From §5.2.1 we 
recognize Na and Al as odd-light elements, Mg, Si and Ca as a elements, 
Ti, Fe and Ni as iron-peak elements and Y, Zr, Ba and Nd as s-process 
elements. Their sample of stars was selected by searching a large catalog 
of stars with Stromgren uwvby8 photometry (§2.3.2) for stars of appropriate 
spectral type and age. Consequently, no inferences can be drawn from their 
results concerning the fractions of stars that have given ages or abundances, 
but currently they provide some of the best information regarding the mutual 
dependencies between abundances, velocity dispersion and age. 


Correlations between abundances The left-hand panel of Figure 10.16 
demonstrates that the abundances of the light and heavy s-process elements 
Y and Ba are tightly correlated, as theory predicts. The right-hand panel of 
Figure 10.16 shows that the abundances of secondary elements such as Ba 
increase faster than those of primary elements such as Mg -— the slope of the 
mean relation in this panel implies that the abundance of Ba varies as the 
abundance of Mg to the power 1.7. In §5.2.1 we argued that, in the absence 
of neutron poisons, the slope of this relation would be 2, and will be smaller 
if poisons are present. Therefore a slope of 1.7 is entirely consistent with 
theory. 
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Figure 10.16 Left panel: the abundances of yttrium and barium in disk stars are very 
tightly correlated. This is interesting in itself and also, crucially, demonstrates that the 
abundances derived by Edvardsson et al. have small experimental errors. Right panel: 
the secondary element Ba increases faster than the primary element Mg. Symbols in each 
panel indicate range of Rm as in Figure 10.18. [After Edvardsson et al. (1993) from data 
kindly supplied by B. Edvardsson] 


We saw in §5.2 that plots of relative abundances such as [O/Fe] versus 
[Fe/H] provide clues as to the sources of heavy-element enrichment. In par- 
ticular, the abundances of O and the a-elements such as Mg are expected to 
decrease relative to Fe with increasing [Fe/H] because they are quickly syn- 
thesized by core-collapse supernovae, whereas the iron-group elements are 
predominantly produced by type Ia supernovae, whose progenitors can be 
long-lived. Figure 10.17 shows that this expectation is amply borne out in 
the Edvardsson et al. data: in the most metal-poor stars, [O/Fe] and [Mg/Fe] 
are both larger by ~ 0.4 than in the most metal-rich stars. By contrast, the 
abundance of the iron-peak element Ni varies in step with [Fe/H], exactly as 
theory predicts. 


Correlations between age and abundance Figure 10.18 shows that 
younger stars tend to be more metal-rich. However, there is a consider- 
able spread in metallicity at a given age. This spread cannot be attributed 
to observational error: the tight correlation in the left-hand panel of Fig- 
ure 10.16 would not have been found if there were substantial uncertainties 
in the metallicity estimates.!° Notice in the upper panel in Figure 10.18 that 
stars that have a higher iron abundance than the Sun have been forming for 
at least 9Gyr. Nonetheless, 7 and [Fe/H] are correlated for two reasons: 
(i) stars older that 10 Gyr almost all have [Fe/H] < —0.5;! and (ii) stars 
less than 5 Gyr old almost all have [Fe/H] 2 —0.5. The pattern of barium 
abundances shown in the lower panel of Figure 10.18 is similar. 


10 Much of the small scatter in Figure 10.16 is also cosmic. 
11 Rdvardsson et al. caution that the paucity of old, metal-rich stars may in part reflect 
their selection criteria, which discriminate against them. 


10.4 Structure of the stellar disk 645 


T T T T T T T T T Tv T 
a : E| 
OL 4 
z sh | 
oe OF ‘“e 4 
Zs - 4 
E ~4 
jar obl aa | 
- 4 
af 4 

oO 1 | 1 1 1 1 | 1 L r L \ 1 L 

| 

T T T T T Tay T T T T T =) 
SEE: | 
ome we 4 
oy L eae 4 
NT 4 
fs cy = mS a =] 
SS OF ate an : Cen a. 7 
tli] - ae re vee te ua 4 
s re . om “ “4 
paiaay on re 4 
ae 4 

o r | 1 1 1 i | 1 1 1 1 | 1 i 

| 

T T T 5 as T T T T T T ] Ls T 
0.4 | = 
m= C : ne eee t : 4 
& o2- thease) 6 Bia : 
™~ af - . a a 4 
is) E arr hae 7 
0 Ne Nie? a. eee 
c es a 
woe Gt i ag 


=-1 =0.5 
[Fe/H] 


Figure 10.17 The dependence upon the abundance of Fe of the ratios of Ni, O and Mg 
to Fe. The ratio of Ni to Fe is flat because these elements are produced alongside one 
another. The ratios of O and Mg to Fe decline with increasing Fe abundance because 
O and Mg, which are produced by short-lived stars, were formed before the disk became 
heavily polluted by iron from type Ia supernovae, which have long-lived progenitors. [After 
Edvardsson et al. (1993) from data kindly supplied by B. Edvardsson] 


oO 


From their space velocities and a model of the Milky Way’s potential, 
Edvardsson et al. calculated the apo- and peri-centric radii of each star and 
defined its characteristic Galactocentric radius, R,,, to be the average of 
these quantities. If, as is probable, stars form on nearly circular orbits and 
only later acquire random velocities, R,, may be a reasonable estimate of the 
radius of a star’s original orbit. In Figures 10.16 to 10.18 different symbols 
indicate the range within which each star’s R,, lies. From the concentration 
of filled dots at large 7 it is evident that the oldest stars in this sample have 
that smallest characteristic Galactocentric radii. This correlation suggests 
that the disk of the Milky Way did not extend as far as the Sun 10 Gyr ago, 
and that we only see very old stars in the solar neighborhood because old 
stars that formed at small radii sometimes pay us a fleeting visit. 

The most metal-poor stars in Figure 10.18 have the smallest values of 
R,, and it has been suggested (Grenon 1989) that this could be because at 
any given radius the metallicity of newly formed stars is always the same, but 
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Figure 10.18 The dependence upon age of the abundances of iron 
and barium. The characteristic Galactocentric radius of each star, 
Rm, is indicated by different symbols: e > Rm < 7kpc; 0 > 
Tkpc < Rm < 9kpc; xX => Rm > 9kpc. [After Edvardsson et al. 
(1993) from data kindly supplied by B. Edvardsson] 


is smaller at smaller radii. In this theory, the apparent correlation between 
metallicity and age arises because only old stars have acquired sufficient ran- 
dom velocity to visit the solar neighborhood from small radii. The a priori 
probability of this theory is low, given that it requires metallicity to increase 
outwards, whereas all the indications are that metallicity decreases outwards 
in external galaxies (§§4.4.5 and 8.2.6), but it does nicely illustrate the dif- 
ficulty of reaching firm conclusions from studies of the solar neighborhood 
alone. All too frequently several physically distinct processes are active si- 
multaneously, and it is not clear what is cause and what is side-effect. 

To distinguish between temporal and metallicity gradients, Edvardsson 
et al. examine means and dispersions of various quantities in each of 20 cells 
in the (R,,,7) plane. Since their entire sample consists of only 189 stars, the 
statistical fluctuations within some cells are uncomfortably large. However, 
they conclude that at a given radius and age there is a significant dispersion 
in [Fe/H]. The standard deviation of this scatter increases from ~ 0.15 at 
2 Gyr to ~ 0.25 for the oldest stars. This conclusion is strongly confirmed by 
the observation that the Orion nebula, a major site of star formation at Ro 
in our own epoch, is significantly metal-weaker than the Sun, which formed 
about 5 Gyr ago. Moreover, we shall see in §10.4.2 below that studies of star 
clusters in the disk indicate that clusters of similar ages and Galactocentric 
radii differ significantly in [Fe/H]. 

To investigate further the relative importance of different enrichment 
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Figure 10.19 |v,| versus 7 (left-hand panel) and |vz| versus [Fe/H] 
(right-hand panel). [After Edvardsson et al. (1993) from data 
kindly supplied by B. Edvardsson] 


mechanisms at various times and places, Edvardsson et al. define a quantity 
{a/Fe] = 5 ([Mg/Fe] + [Si/Fe] + [Ca/Fe] + [Ti/Fe]). (10.39) 


They find that there is negligible scatter in [a/Fe] in a given cell of the 
(Rm,T) plane. Given the finding above that [Fe/H] does show scatter, this 
conclusion is important. On the one hand, the scatter in [Fe/H] indicates that 
the ISM at a given radius is imperfectly mixed, simultaneously containing 
regions that differ in [Fe/H] by a factor 2. On the other hand, the absence 
of scatter in [a/Fe] implies that the products of a massive supernova are 
thoroughly stirred into the ISM before significant star formation can occur 
in the region enriched by it. 

The mean values of [a/Fe] tend to increase with age at a fixed value of 
Rm. For the youngest stars, [a/Fe] ~ 0, and for the oldest, [a/Fe] ~ 0.25. 
Since essentially all the oldest stars have small R,,, it is not possible to be 
sure whether the main dependence of [a/Fe] is upon radius or age, but the 
data suggest the latter. In fact, the natural interpretation of these data is 
that early in the life of the Milky Way the star formation rate was high, and 
large numbers of stars formed after the ISM had been heavily polluted by 
massive supernovae but before type Ia supernovae had produced much iron. 

Let us now return to random velocities. The variation of random velocity 
with age is harder to study than the corresponding metallicity variations 
because one cannot measure the velocity dispersion of a single star; many 
stars are required for a single determination of a,. Since the Edvardsson 
sample contains only 189 stars, subdividing the sample to the required extent 
is scarcely practicable. Therefore in Figure 10.19 we plot for each star |v,| 
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Figure 10.20 Velocity dispersion versus age as determined by 
Jahreiss & Wielen (1983) (left panel) and by the Danish group 
(right panel). Aside from a difference in the age scales used in 
these studies, the two determinations are broadly consistent with 
one another. [From data published in Jahreiss & Wielen (1983) 
and in Stroémgren (1987)] 


against [Fe/H] and r. By estimating the vertical extent of the distribution in 
this figure at each value of 7, the eye sees at a glance that the dispersion o, 
increases with age. The right-hand panel suggests that the dispersion in 0, 
is fairly constant for [Fe/H] 2 —0.3 and increases rapidly with falling [Fe/H] 
at [Fe/H] < —0.3. 


Additional light is cast on this problem by samples of stars that have accu- 
rate space velocities and estimates of age and/or metallicity that are based 
on low-dispersion spectra or intermediate-band photometry rather than on 
high-dispersion spectra. Such data are available for several thousand stars. 
Jahreiss & Wielen (1983) estimated the ages of stars in the Gliese Catalog by 
one of two methods. For early-type stars they estimated ages from the stars’ 
positions in the CM diagram (all these stars have reliable distances). The 
ages of those K and M stars which were in the McCormack Catalog (Vyssot- 
sky 1963) were estimated from the strength of their Ca H and K lines. The 
left-hand panel of Figure 10.20 shows their result: both og and o, increase 
steadily with age to quite large velocities (op ~ 60kms~, o, ~ 30kms~"). 

Data from a large Danish program of uvby@ photometry provides an 
entirely independent check on the trend of velocity dispersion with age shown 
in Figure 10.20 as well as a check on the variation of velocity dispersion 
with metallicity that is suggested by Figure 10.19. The right-hand panel 
of Figure 10.20 shows the relation between age and velocity dispersion that 
emerges from this study. The adopted age-scale differs from that used by 
Jahreiss & Wielen by a factor of almost 1.5, but when allowance is made for 
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Figure 10.21 The dispersions in vg and vz and the mean age 
(r) of 1294 stars binned by metallicity. [From data published in 
Strémgren (1987)] 
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this difference, the age—velocity-dispersion relations shown in the left- and 
right-hand panels of Figure 10.20 are broadly consistent. 

The lower panel in Figure 10.21 shows the variation of op and ao, with 
[Fe/H] in data gathered by the Danish group. Both components of velocity 
dispersion increase as [Fe/H] decreases, very much as Figure 10.19 suggests. 
The upper panel in Figure 10.21 shows the mean age of the stars in each 
metallicity group. We see that metal-poorer stars tend to be older, and, as 
Figure 10.20 implies, older stellar populations have larger velocity disper- 
sions. Hence, Figures 10.20 and 10.21 tell a consistent story. 

From Figures 10.19 , 10.20 and Figure 10.21, it is clear that the velocity 
dispersion of a group of stars increases with its age and decreases with its 
metallicity. What is the correct physical interpretation of this result? Are 
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Figure 10.23 The lower panel plots values of [Fe/H] for 44 old 
disk clusters against estimates of the clusters’ ages. The upper 
panel plots [Fe/H] against the Galactocentric radius of each cluster. 
[From data published in Friel (1995)] 


metal-poor stars born with larger random velocities and at earlier times than 
metal-rich stars? Or are both metal-rich and metal-poor stars born with 
small random velocities and then gradually increase their random velocities 
as a result of some dynamical process that is at work within the disk? In 
this second case, the observed correlation between random velocities and 
metallicity might be simply a consequence of the tendency of metal-rich 
stars to be, on the average, younger than metal-poor ones. 

Figure 10.22 suggests that neither of these pictures represents the whole 
truth by showing the variation with 7 of op and o, for a sample of relatively 
metal-rich stars (—0.15 < [Fe/H] < 0.15). Both components of velocity 
dispersion increase with age for 7 S 7 Gyr, but there is a suggestion that 
they level off at og ~ 35kms~1 and o, ~ 16kms~! for rT > 7Gyr. These 
asymptotic values of the velocity dispersions are almost a factor of 2 lower 
than the dispersions of the oldest stars in Figure 10.20. Hence it seems 
that by merely hanging around in the disk, stars do not acquire velocity 
dispersions as large as those observed in the more metal-weak stellar groups. 
This conclusion is very much in line with the predictions of the theory of 
the stochastic acceleration of disk stars — see §7.5 of BT. Indeed Jenkins 
(1992), in a very complete and careful study of this problem, concluded 
that a combination of excitation by spiral structure and scattering by giant 
molecular clouds can nicely explain the trends seen in Figure 10.22, but 
cannot plausibly account for the existence of groups of stars with op ~ 
60kms~?. 
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10.4.2 The old disk clusters 


As we saw in Chapter 6, a star cluster can be dated relative straightforwardly 
from the structure of its CM diagram, and of course we can obtain a reason- 
able estimate of the mean metallicity of its stars. Moreover, many, perhaps 
most, field stars in the disk were formed in a cluster or association, which 
may by now have completely dissolved. Hence studies of old disk clusters 
provide valuable information about the history of the disk. 

In the lower panel of Figure 10.23 we plot [Fe/H] for 44 old disk clusters 
against estimates of the clusters’ ages 7. It is clear that clusters of very 
different metallicities have formed at similar times. The upper panel suggests 
that the further a cluster is from the Galactic center, the lower its metallicity 
is likely to be, and it is found that metallicity also tends to decrease with 
increasing distance from the plane. In fact, from a study of red giants in 63 
open clusters, Piatti, Claria & Abadi (1995) estimate that 


([Fe/H]) = 0.6 + 0.1 — (0.07 + 0.01) 


ia (10.40) 
— (0.34 + 0.03) — (0.002 + 0.005 
(0.34 + 0.03) mers ) 


Gyr 
Notwithstanding these trends, clusters with very different values of [Fe/H] 
can be found at any given distance from the center and/or height above 
the plane (Nissen 1988, Boesgaard 1989, Garcia Lopez, Rebolo & Beckman 
1988). Thus old disk clusters display the same chemical inhomogeneity and 
mild metallicity gradients that we encountered above in the Edvardsson et 
al. (1993) sample of field stars. This finding adds weight to the idea that field 
stars were formed in clusters and subsequently either escaped their parent 
cluster, or became free when the cluster dissolved. 


10.4.3 Star counts and the thick disk 


The variation with distance z from the Galactic plane of the density of vari- 
ous types of stars can be determined by counting faint stars in fields centered 
on a Galactic pole. The classic study of this problem is that of Gilmore & 
Reid (1983), who determined the photometric distances of more than 12000 
stars over 18 square degrees of the sky from photographic plates of the south 
Galactic pole that had been taken with a Schmidt telescope. Their key plates 
were in the V and J bands since they wished to obtain accurate results for 
lower-MS stars, which are red in color. On these plates, an automatic mea- 
suring machine determined the V and J magnitudes of all objects brighter 
than V = 19 and J = 18. The measuring machine was able to determine 


652 Chapter 10: Components of the Milky Way 


rather reliably which of these relatively bright images were fuzzy and there- 
fore galaxy images - in this way it produced a list of ~ 12500 stars with 
known V and J magnitudes. Reasonably accurate absolute trigonometric 
parallaxes and therefore V-band magnitudes, My, were available for about 
200 of these stars. These values of My confirmed that most of the stars in 
the sample are MS stars, and defined the relationship between My and V—I 
in the photometric system employed. Most of the stars that are giants rather 
than MS stars will have color V — I ~ 2.3, which on the MS is associated 
with My ~ 6.5. Since in the absence of spectra the analysis must proceed 
under the assumption that all stars are dwarfs, the inferred stellar density 
at My ~ 6.5 is most suspect, and should be treated with caution. 

Clearly, all stars with trigonometric parallaxes are located close to the 
Sun, and it is not clear that the My /(V — J) relation defined by such stars 
applies well away from the plane. This doubt arises because the mean metal- 
licity of stars might be expected to decrease with distance z from the plane, 
and we know that a metal-poor MS star is fainter than a metal-rich one of 
the same color (see §5.1.5). Since Gilmore & Reid did not know the mag- 
nitude of the vertical metallicity gradient in the disk, they analyzed their 
data twice, once under the assumption that the gradient was negligible, and 
once under the assumption that d[Fe/H]/dz = —0.3kpc’ for z < 5kpce and 
vanishes at larger z. Fortunately, most of their conclusions stand for either 
assumed metallicity gradient. For simplicity the results presented below are 
for the case of no metallicity gradient. 

Under the assumptions that every star is a MS star and that there is 
negligible vertical metallicity gradient, the distance to each star can be triv- 
ially derived from equation (2.36). When a metallicity gradient is assumed, 
distances are obtained iteratively. My is first obtained from V — J under 
the assumption of solar metallicity. Then z is inferred from equation (2.36), 
and My is changed to the value appropriate to the metallicity of the stars 
at the derived value of z. Now z is updated, and the cycle repeated until the 
derived values of My and z have converged. 

Once a value of z has been associated with each star, a table can be 
constructed in which each entry shows the number of stars observed in given 
small ranges of My and z. Down a column of fixed z, this table gives the 
luminosity function ®(My) that applies at a given distance from the plane. 
Along a row of fixed My, the table gives the number of stars of a given kind 
that are observed in thin slices of constant z across the cone of observation 
— clearly, this cone has the Sun at its apex. 

Figure 10.24 shows the luminosity functions derived by Gilmore & Reid 
(1983) for different distances z from the plane — each luminosity function 
is labeled by the logarithm of (z/1pc). At large values of z, the luminosity 
function is determined only at bright magnitudes because less luminous stars 
fall below the apparent magnitude limit of the survey. Although the different 
lengths of the curves in Figure 10.24 make it hard to compare the shapes 
of the underlying luminosity functions, it is apparent that at log(z/1pc) S 


10.4 Structure of the stellar disk 653 


log ® 
T T 
if 
a 
L i L | i 


6b $05 

= 3.25 =| 

FE us | 
3.45 

b 3.85 4 
8 | eae Bee ier (ee tees eer Lt L [ame eees eee Sees pees eee Pa a 
12 10 8 6 4 2 

My 


Figure 10.24 Each curve shows the luminosity function that was 
determined by Gilmore & Reid (1983) for the distance in parsecs 
from the plane whose logarithm is given at the right-hand end of 
the curve. Notice that the three uppermost curves fall less steeply 
to the right of My ~ 3.5 than do the lower curves. 


2.45 the luminosity function falls less steeply at bright magnitudes than it 
does further from the plane. Physically, stars brighter than My ~ 3.5 are 
more strongly confined to the plane than are fainter stars. Since stars with 
My S 3.5 have lifetimes that are significantly shorter than the Hubble time, 
they constitute a relatively young population. From Figure 10.22 we know 
young stellar populations have smaller velocity dispersions. Thus these stars 
are strongly confined to the plane because when they pass through the plane 
they have insufficient kinetic energy to rise high above the plane. 

Figure 10.25 shows the density profile v(z) derived by Gilmore & Reid 
for stars with absolute magnitudes in the range 4 — 5. This profile clearly 
differs from an exponential v « e~*/*°, which in this log-linear plot would 
be a straight line. The dashed curve in Figure 10.25 shows that the vertical 
density profile can, however, be accurately represented as the sum of two 
exponentials, one of which has scale height z9 = 300 pc and dominates the 
overall density distribution at z < 1200 pc, while the other exponential has 
scale height z9 = 1350 pc. The vertical density profiles of less luminous stars 
can also be successfully modeled as a sum of these two exponentials, although 
the evidence for the exponential with scale height ~ 1350 pc is weak or absent 
for stars much less luminous than My = 5 because the density profile of these 
stars cannot be followed far from the plane. The vertical density profiles of 
stars that are more luminous than My ~ 3 require a significantly smaller 
scale height than z9 = 300 pc for the exponential that dominates the model 
density near to the plane. Moreover, for these luminous, short-lived stars 
the requirement for a second exponential with a large scale height is not 
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Figure 10.25 The space density as a function of distance z from 
the plane of MS stars with absolute magnitudes 4 < My <5. The 
full lines are exponentials with scale heights zo = 300 pc (at left) 
and zp = 1350pc (at right). The dashed curve shows the sum of 
these two exponentials. [From data published in Gilmore & Reid 
(1983)] 


clear-cut. Ojha et al. (1996) found that the vertical distribution of stars 
fainter than My = 3.5 is well represented by the sum of two exponentials, 
one having scale height zo = 260 + 50pc and the other having scale height 
zo = 760450 pc. In the plane the luminosity density of the second component 
is smaller than that of the first by a factor 135755; which implies that ~ 20% 
of the luminosity of the disk resides in the thicker component. 


The thick disk The fact that the vertical density profile v(z) cannot be 
fitted by a single exponential has two possible physical interpretations. In 
the simplest picture, the Galaxy possesses a single disk, and it just happens 
that the vertical density profile of this disk is non-exponential. A more elab- 
orate interpretation is that the Galaxy possess both thin and thick disks. In 
this picture both disks have approximately exponential profiles y;(z), and 
the observed profile is non-exponential because it is dominated by the thin 
disk at small z and the thick disk at large z. Since we know of no reason 
why the vertical profile of a disk should be exponential rather than follow- 
ing some other functional form, the non-exponential nature of the observed 
density profile by itself hardly calls for decomposing the observed disk into 
two component disks. What would justify such a decomposition is evidence 
that the two disks are made of different kinds of stars. Hence we now ask 
whether it is possible to assign stars to either one disk or the other according 
their intrinsic properties, such as age or metallicity. 

Notice that to be meaningful, the division of stars between the thin 
and thick disks must be independent of either position or velocity. To see 
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Figure 10.26 A potential division of stars between the thin and 
thick disks. Stars above and to the left of the dotted lines are 
assigned to the thin disk, while those below and to the right of the 
lines are assigned to the thick disk. 


this, consider the possibility of decomposing a spherical stellar system whose 
density profile v(r) follows the modified Hubble law [equation (4.13)] into 
a Plummer law v(r) = vo(1 + r?/r2)—5/? and the difference between this 
profile and the Hubble law. This decomposition could be made, and then 
the observed system would be dominated by the Plummer component at 
small radii and the difference component at large radii. Nonetheless, if all 
the system’s stars were identical, it would be a senseless exercise that could 
only be inspired by some theoretical prejudice in favor of Plummer models. 
On the other hand, if the stars that comprised the Plummer model were all 
older than the remaining stars, the division into two components would be 
natural and convey important information about the manner of the system’s 
formation. 

A key point about the first, artificial, division of the modified-Hubble 
system is that it is not unique: if all stars are identical, a given star could be 
assigned a probability of belonging to one disk rather than the other on the 
basis of its position and velocity. For example, a star of low energy, which 
is confined to small radii, would have a high probability of belonging to the 
Plummer component, while a high-energy star would have a high probability 
of belonging to the other component. However, two stars of identical ener- 
gies but different component assignments could always be swapped from one 
component to the other; it would be in principle impossible to know which 
was the ‘correct’ assignment. By contrast, if the stars of one component are 
older than those of the other, any uncertainty in the assignments will merely 
reflect imperfections in the data. 

With this example in mind, let us now return to the data plotted in 
Figures 10.17 to 10.19 to see whether we can distinguish stars of the thin 
disk from those of the thick disk. Figure 10.26 shows one of these data 
sets with a potential division into thin- and thick-disk stars. According to 
this division, thick disk stars are older than 10Gyr and more metal-poor 
than [Fe/H] = —0.4. The great majority of the remaining stars, which 
are by definition all younger than 10Gyr, are also more metal-rich than 
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Table 10.4 Characteristic velocities of the disks 


(kms~+) 
(vs) oR 9 
Thin disk —6 34 21 18 
Thick disk —36 61 58 39 


- SOURCE: Data published in Edvardsson et al. (1993) 


[Fe/H] = —0.4. We identify stars younger than 10 Gyr and more metal-rich 
than [Fe/H] = —0.4 with the thin disk and reserve judgment on the small 
number of stars that fall outside these two areas of Figure 10.26. Inter- 
estingly, the middle panel of Figure 10.17 shows that nearly all stars with 
[Fe/H] > —0.4 have [Mg/Fe] < 0.2, while most of the more metal-poor stars 
have significantly larger over-abundances of the a-element Mg. This division 
suggests that type Ia supernovae contributed significantly fewer iron-peak 
elements to thick-disk stars than they did to thin-disk stars. 

Now that we have divided stars between the disks, it is interesting to turn 
to Figure 10.19 to see what the kinematic consequences of this division are. 
Although the case is not compelling, a widening of the velocity distribution 
to the right of 7 = 10 Gyr is discernible. This impression is confirmed by 
calculating the velocity dispersions of stars to the right and left of the line 7 = 
10 Gyr: the 44 stars to the right of this line have a dispersion of 38kms", 
while those to the left have a dispersion of 21kms~!. Indeed, even if we 
exclude from the left-hand group stars with r < 3 Gyr, where there appears 
to be an abrupt increase in the velocity dispersion, the velocity dispersion 
of the left-hand group is still only 22kms™, little over half that of the 
right-hand group. Finally, if we refer all velocities to the LSR as defined by 
equation (10.11) and calculate the velocity dispersions of just the stars that 
can be definitely assigned to one disk or the other according to the definition 
of Figure 10.26, we find the numbers given in Table 10.4, which may be 
considered to be characteristic of the two disks. 

Is the thick disk a real physical entity or merely a useful short-hand for 
the tail of the stellar distribution of the disk as a whole? We doubt that this 
question can yet be answered conclusively. Figure 10.26 makes the strongest 
case for the thick disk of which we are aware. While it certainly suggests 
that the thick disk is real, data from just a handful of extra stars could 
substantially weaken the case. A man could not be safely hanged on such 
evidence. 


10.4.4 The local mass density of the disk 


An obvious question is ‘how much of the Galaxy’s mass resides in the disks, 
thick and thin?’ The only secure way to determine the mass of an astronom- 
ical body is to measure the strength of the gravitational field F to which it 
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gives rise. In this section we explain how studies of the kinematics of stars 
enable us to estimate F in the solar neighborhood. 

Once F has been determined, we can (in principle) determine the local 
mass density, p, from Poisson’s equation, 


V -F = —4nGp. (10.41) 
If we assume that the Galaxy is approximately axisymmetric at Ro, the 


natural coordinate system in which to write (10.41) is a cylindrical system 
(R,¢,z). In this system, Poisson’s equation for an axisymmetric system 


reads re oF 

——(RF ==- . : 

R DR! R) + a 4nGp (10.42) 
Now the circular speed is given by v2/R = —FpR, so we may rewrite equation 


(10.42) in the form 


2 
~aalSe - Ran): 


Observations tell us that v, is approximately independent of R (see §9.2.3), 
so the second term on the right-hand side of (10.43) is small compared to 
the first (see Problem 10.10). Moreover, simple galactic mass-models suggest 
that this term does not depend much on z for z < R. Therefore it is an 
excellent approximation to assume that it is independent of z, and to find 
on integrating (10.43) with respect to z that the mass per unit area within 
distance z of the plane is 


p= (10.43) 


1 


gf app ie ee _ 2 Ove 
xu(R, z) =2f dz’ p(R, 2’) x iG (Fr2) a5n): (10.44) 


Here we have assumed that the Galaxy is symmetrical in z, with the con- 
sequence that F,(R,0) = 0. Near the Plane we expect the mass density to 
be dominated by the thin disk, because it is so flattened. Therefore we may 
crudely identify, say, (Ro, 320), where 2 is the scale height of the thin disk, 
with the local surface density of this disk. In a more sophisticated analysis 
we can subtract the contribution of any massive halo from X(Ro, 3z0) to ob- 
tain a more accurate estimate of the density of the disk — see Problem 10.11 
and the discussion below. 

From equation (10.44), it is clear that the key to estimating the surface 
density of the layer that lies at |z’| < z is the determination of F,(R, z).!? 
The physical idea that underlies the determination of F', is the following. We 
identify some population of stars lying ~ z from the plane and determine 
both their typical random z-velocities o,, and how fast the number density 
v of this population declines with increasing |z|. We then argue that this 


12 F, is often denoted K,, where K stands for Kraft, the German for force. 
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decline in v arises because the slower stars observed at z have insufficient 
kinetic energy to reach z + dz, where the potential energy of stars is greater 
than at z by |F',|dz. Hence the larger F, is, the more steeply v declines with 
{z|. 

Unfortunately, the accurate implementation of this simple physical idea 
is fraught with difficulty. The essential problem is that stars have significant 
kinetic energy associated with their radial motions, and this energy should 
not be completely neglected when deciding to what altitude a given star 
can rise. Most studies have neglected the coupling of the radial and vertical 
motions, however. 

The classical study of this problem was made by Oort (1960), who 
worked from the Jeans equation (§4.2 of BT) 

Ovo? 10 

ara = RoR Rez): (10.45) 
Here o?,, is the average over the population of the product of velocity com- 
ponents vpu,, and the term containing it arises from the coupling between 
radial and vertical motions. Oort dropped this term, as did Bahcall (1984) 
in an extensive re-examination of the problem that will be described below. 

Kuijken & Gilmore (1989) obtained a first estimate of F, by neglecting 
the vr — v, coupling, and then made a slightly obscure correction for it. We 
shall describe their discussion in some detail, as it is the most recent major 
study of the problem. Significant commentaries on it can be found in Gould 
(1990b) and Kuijken & Gilmore (1991). 

For the z motions of stars to decouple from their radial motions, the 
gravitational potential ®(R, z) must be of the form 


5(R,z) = a(R) + &.(2), (10.46) 


where ®p and ®, are suitable functions of one variable. If ® is assumed to be 
of this form, we can focus on the distribution of stars in the (z, v,) plane — as 
we shall see below, it is in principle possible to determine the location within 
this plane of every star in a suitable sample. Since our population should by 
now have achieved a steady state, Jeans’ theorem (§4.2.1 of BT) assures us 
that the density f(z,v,) of stars within this plane must be a function of z 
and v, only through the energy of z-motion, 


E, = $v7 + &,(z). (10.47) 


vF, = 


That is, 
f(z, v2) = f.(E:). (10.48) 


The stellar number density is 


(2) = f dv, fle,v2) 


ees 10.49) 
=2f” a, h@2 ( 


&,(z) /2[E. — &,(2)]’ 
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where the second line has been obtained from the first by a simple change of 
integration variable. 

Since we may measure v(z) fairly accurately, Kuijken & Gilmore argue 
that we should regard equation (10.49) as an integral equation for f(E.) 
given v(z). If 6,(z) were known, then v could be considered to be a function 
v(®,). Equation (10.49) would then become an Abel equation (see §1B.4 of 
BT) with solution 

i(EyS= me d®, ea he (10.50) 

T JE, V 2(®, . E,) 
If we were to use our known form ®,(z) and equation (10.47) to eliminate EF, 
from equation (10.50) in favour of z and v,, we would obtain a prediction for 
the entire distribution of the stars in the (z,v,) plane. This line of argument 
motivated Kuijken & Gilmore to proceed as follows. They assumed that ®, 


is given by ‘ 
Pele) a (V2 + D? — D)K + Fz’, (10.51) 


where D, F and K are constants whose physical motivation will emerge 
shortly. When this form of ®, is adopted, and the observed form of v(z) and 
the definition (10.47) are used in equation (10.50), we obtain a prediction 
for f(z,v,) that depends on the parameters (D, F,K). Kuijken & Gilmore 
calculated the likelihood!? LD of their observed distribution of stars in (z, v,) 
for each set of values of the parameters. The optimum parameters are those 
that maximize L. Once ®,(z) has been determined in this way, F(z) fol- 
lows by differentiation of equation (10.51), and then U(z) can be found by 
substitution of F, into equation (10.44). 

The physical motivation of equation (10.51) is as follows. Differentiating 
it we find 


|F.| _ Kz 

2G /z? + DP 
The first term on the right-hand side rises linearly as Kz/D for z < D 
and tends to K at z > D. This is the characteristic z-dependence of the 
gravitational force from a disk whose scale height is of order D and equatorial 
density is K/D — K is the surface density of this disk. The second term on 
the right of equation (10.52) represents the contribution to F, of the Galaxy’s 
spheroidal components. Indeed, the potential Bg of such a component may 
be taken to be constant on spheroidal surfaces with axis ratioa:c=1: 4, 
where gq is of order 1. By the chain rule, ®s(R? + z?/q”) contributes a vertical 
force 


+ 2F 2. (10.52) 


08s ) 2z 


ed = (Sn gq? 


where m = R? + 27/q’. (10.53) 


13 The likelihood of given data is simply the unnormalized probability of the data given 
that they are drawn from some model probability distribution. 
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Figure 10.27 The distribution in the (z, vz) plane of 472 K dwarfs 
seen towards the SGP. [From data published in Kuijken & Gilmore 
(1989)] 


If we evaluate (06s /Om) at (Ro, 0), equation (10.53) for Fs becomes iden- 
tical with the last term in equation (10.52) with F = (06s/8m)/q?. The 
dimensions of F' are those of mass density, and we may loosely consider it to 
be the local density contributed by the spheroidal components provided we 
bear in mind that F does not actually vanish in the complete absence of a 
spheroidal component. 

After determining F', in the way we have described, Kuijken & Gilmore 
(1989) corrected their raw value for the effects of the neglected vr — v, 
coupling. The idea underlying this correction is that uncoupled motion in 
the (z, vz) plane of the type they assumed is associated with a Jeans equation 
that is identical with the correct Jeans equation (10.45) with the second term 
on the right dropped. Therefore the raw value of F, determined by Kuijken 
& Gilmore may be identified with v~![0(va2)/0z], and the true value of F, 
is the raw value plus the contribution from the second term on the right 
of equation (10.45). The magnitude of this term (which turns out to be 
negative) depends on the shape of the velocity ellipsoid (see §10.3.2) and the 
way in which its orientation changes as one moves away from the plane - 
see §4.2.1(c) of BT. Although the shape of the velocity ellipsoid is known, 
the dependence of its orientation upon z is unknown. It may, however, be 
guessed with reasonable plausibility. In this way Kuijken & Gilmore arrive 
at a range of plausible values for the required correction. 

Kuijken & Gilmore applied their technique to a sample of 512 K dwarfs. 
These stars were selected from 11.5 square degrees at the south Galactic pole 
as follows. Spectra were obtained of all stars with V < 19 and B < 20.06 and 
B-V in the range (0.8,1.2). From these spectra v, could be estimated to 
better than 10kms~+. Giants were eliminated from this sample as apparently 
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bright stars that did not show significant MgH absorption; the quality of the 
spectra made it difficult to identify MgH absorption in apparently fainter 
stars, but it is inherently unlikely that a faint star is a giant. The color— 
magnitude relation for solar-metallicity dwarfs was then used to determine 
the distance z to each star, and then the presence of a metallicity gradient 
was allowed for by an iterative procedure very similar to that described in 
§10.4.1. Finally, the standard correction for Malmquist bias [equation (3.34)| 
was applied to the derived My of each star. 

Figure 10.27 shows the final distribution of stars in the (z,v,) plane. 
The distribution in z is determined by competition between the increase 
«x z? in the volume that lies within the cone of observation within dz of z, 
and the declining density of stars as one moves away from the plane. For 
z S 200 pc there is little decline in the density, and the number of stars in the 
sample increases rapidly. There follow a broad maximum and a long steady 
decline. When the distribution of Figure 10.27 was analyzed as described 
above, Kuijken & Gilmore (1991) concluded that within 1.1 kpc of the plane 
the surface density of material is 


Y11(Ro) = 7146 Mo pe”. (10.54) 


Uncertainty regarding the variation of the velocity ellipsoid near the Sun 
contributes an uncertainty of about 15% to this number. Bahcall et al. 
(1992) obtained very similar estimates of U(Ro) from a sample of 125 K 
giants. 

Significant controversy surrounds the question of how much of the sur- 
face density (10.54) is attributable to the disks rather than to spheroidal 
components (chiefly the ‘dark halo’). Mathematically, the problem is that 
the data constrain the values of the right-hand side of equation (10.52) at 
z~1.1kpc, but permit variations in the parameters D, F and K that are 
consistent with this value. Since, as we have seen, the physical interpretation 
of K is the surface density of the disk, uncertainty in K implies uncertainty 
in the surface density of the disk. 

Kuijken & Gilmore (1989) resolved this question by remarking that, 
since the circular-speed curve is known, mass taken from the disk must be, by 
and large, added to the spheroidal components. Hence acceptable solutions 
with small K should have large F,, and vice versa. A study of specific Galactic 
mass-models led them to the constraint 


(15+ 3) Mo pe? = 0.094K + F x 1kpe. (10.55) 
When F and K are constrained by equation (10.55) as well as by the obser- 
vationally determined value of equation (10.52) at z ~ 1.1kpc, Kuijken & 
Gilmore (1991) find that the surface density of the local disk is 


La(Ro) = 4849Mo pe ?. (10.56) 
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Unfortunately, the uncertain behavior of the velocity ellipsoid near the Sun, 
which contributes an uncertainty of only ~ 15% to equation (10.54), renders 
equation (10.56) extremely uncertain. It appears that at present we cannot 
strongly constrain the mass density near the Sun. 

When our estimate (10.56) of the local mass-density of the disk is com- 
bined with estimates of the disk’s local surface brightness (§10.1.1), we con- 
clude that the mass-to-light ratios of the disk are 


Ya(Ro) Y2.3Me Lz' 
Yx(Ro) ¥1.0Mg LG. 


The errors in these mass-to-light ratios are large because we are forming 
ratios of numbers that are themselves subject to large errors. 

How much mass do we know to reside in the local disk in the form 
of known stars, interstellar gas and so forth? Figure 9.19 implies that the 
ISM contributes Dism(Ro) ~ 5.5 Me pe-?. The mass contributed by stars 
is extremely difficult to assess. From the Gliese catalog, we have a fair idea 
of what stars lie within 10pc of the Sun. In particular, we saw in §3.6.4 
that stars contribute about p, = 0.1 Mo pce? at the Sun. Star counts such 
as those analyzed by Kuijken & Gilmore (1989) enable us to determine the 
ratio of the number density of stars at altitude z to their number density 
near the Sun. Hence determining the contribution of stars to the local surface 
density would be simple if the mix of stars were the same at all altitudes. 
Unfortunately, the mix of stars must vary systematically with z, since, as 
we saw in §10.3.2, different stellar types have different values of (v2) in the 
solar neighborhood. Thus each stellar population must have its own vertical 
density profile. We cannot determine all these profiles by direct observation, 
but must resort to dynamical models in which the vertical density profiles 
of most populations are determined by requiring that each population is in 
dynamical equilibrium in a common potential. The structure of the potential, 
on the other hand, is determined by requiring the density profile of a selected 
population, for example K dwarfs, agrees with an observationally determined 
profile. That is, the contribution of stars to the local surface density can in 
practice not be determined independently of the problem of determining F,. 

Since Kuijken & Gilmore (1989) consider only a single tracer population, 
their analysis cannot be used to predict the density of all types of stars 
throughout the disk. The methodology of Bahcall (1984) is, by contrast, well 
suited to this problem. In the standard way, Bahcall reduced the problem to a 
one-dimensional one by neglecting the vrg—v, coupling. Then he decomposed 
the matter near the Sun into a series of isothermal populations — each such 
population has a distribution function of the form!* 


f.(Ez) = 2 exp(-E./0?), (10.58) 


J 2r0? 


14 Tt is now convenient to define the distribution function f to be the mass density in 
the (z,vz) plane. 


(10.57) 
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Table 10.5 Estimates of the mass density of the disk at the Sun 
Ptot (0) Pdark (0) Dot (co) “dark (co) 


Model Mo pc? Mepe3 Mepc-? Me pc-? 
Dark matter has 0.89 0.79 70 20 
(v2)* ? — Akms7} 

Dark matter has 0.21 0.10 88 38 
(v2)* ? — 20.3kms~—! 

Dark and luminous mat- 0.26 0.16 84 34 


ter have identical profiles 
SOURCE: Data published in Bahcall et al. (1992) 


where pjo and o; are constants characteristic of the i** population. It is easy 
to show: (i) that pio is the mass density of stars where ®, = 0, which we may 
place at z = 0; and (ii) that the velocity dispersion (v2)/ ? of each isothermal 
population is independent of z, and has value o;. With this result, equation 
(10.45) simplifies to 


(10.59) 
where the density p;(z) is given by 
pi(z) = pio exp(—®,(z)/o?). (10.60) 


Now the total density of matter near the Sun is just the sum over i of 9;, 
and F, = —86,/0z, so equation (10.43) becomes 


ao, n 1 du; 
Oz? ROR 


4nG » pio exp(—®,(z)/o?) = (10.61) 


Once the constants pio and o; have been chosen, equation (10.61) becomes an 
ordinary differential equation for ®,(z). The density p;(z) of each component 
follows when our solution, &,(z), is substituted into equation (10.60). 

The density profiles p;(z)/pio of certain tracer populations can be de- 
termined from star counts. If the observationally determined profiles agree 
with the theoretical ones from equation (10.61), all is well. However, Bahcall 
found that the theoretical profiles decrease with increasing |z| less rapidly 
than do the observational profiles. This discrepancy is what would occur if 
some components had been missed out, for the more mass there is, the faster 
®, increases away from the plane, and the more steeply the densities (10.60) 
decrease with increasing |z|. Bahcall tried to determine how much missing 
matter there was by adding a single isothermal ‘dark-matter’ component and 
adjusting its mid-plane density ppyo until an acceptable fit was obtained to 
the observed profile of a tracer population. The amount of dark matter re- 
quired varies with the assumed dark-matter velocity dispersion apy: larger 
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values of apm yield lower densities p(0) in the plane but larger total column 
densities D.(Ro) through the disk. The first two rows of Table 10.5 gives 
results from Bahcall et al. (1992) for the case that the dark matter has a 
low velocity dispersion and a velocity dispersion characteristic of K giants. 
The third row gives densities for the case that the density of dark matter 
is a fixed multiple of the density of luminous matter. The volume densities 
are uncertain by at least a factor of 2, and the surface densities by at least 
0.2 dex. 

The difficult step in the practical application of equation (10.61) is the 
identification of populations, for they must be isothermal populations rather 
than mere collections of physically similar stars. Consider, for example, white 
dwarfs as a candidate population. Do they form an isothermal population? 
We have no reason to believe so a priori, so this question should be settled 
observationally by determining the velocity distribution of white dwarfs near 
the Sun, and deciding whether it is Maxwellian. If it proves to be significantly 
non-Maxwellian, we will have to treat the white dwarfs as a superposition of 
more than one isothermal component. For example, if there are more high- 
speed white-dwarfs than a Maxwellian predicts, we shall have to add to the 
basic isothermal population a subsidiary population that has a larger value 
of o;. Near the plane, the white dwarf density may be dominated by the 
basic population, but at sufficiently large |z| the subsidiary population will 
be dominant. Therefore serious errors in p;(z) can arise from small deviations 
of the local velocity distribution from a Maxwellian. Gould (1990a) discusses 
whether a given population is sufficiently Maxwellian. 

Kuijken (1991) describes a technique in which ®(z) is estimated from 
the distributions of tracer populations in the (z,v,) plane without making 
any assumption of isothermality. By applying this technique to a sample of 
K dwarfs, he concluded that ptot(0) < 0.22 Mo pe and that there is no 
requirement for dark matter in the Galactic disk. At present the amount of 
stellar matter in the disk is significantly uncertain. 


10.4.5 Distribution of the youngest stars 


The appearance at optical wavelengths of external spiral galaxies is pro- 
foundly affected by the large-scale distribution of young stars because these 
objects and their associated HII regions trace out spiral arms. What can we 
say about the large-scale distribution of young stars in the Milky Way? 
Much of the disk, within which stars form, is obscured from the Sun 
by several magnitudes of optical extinction. This extinction both confines 
our field of study to within a radius of ~ 4kpc around Sun, and distorts the 
apparent distribution of young stars. Indeed, extinction is generally patchy, 
so we can see to greater distances along some lines of sight than along other 
nearby ones, and the apparent distribution of young stars will inevitably 
be influenced by the distribution of extinction. To compound these already 
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Figure 10.28 The distribution of Cepheid variables within 5 kpc of the Sun (left) and of 
open clusters in Lynga (1987) with ages < 30 Myr (right). The upper two dotted curves in 
each panel show the ridge-lines of the Sagittarius—Carina arm (top) and the Orion—Cygnus 
arm (middle) from Figure 9.20. 


formidable difficulties, the literature is deficient in systematic surveys for 
objects associated with star formation, so we have to rely on catalogs that 
have been put together from a variety of sources that contain all manner of 
unquantified selection effects. For all these reasons, and the usual difficulty 
of determining distances to astronomical objects, we shall outline a picture of 
the distribution of young stars that is woefully out of focus so that a measure 
of imagination will be required to make out features that may in reality be 
well defined. 

Cepheid variables are useful tracers of star formation since their ages 
are < 0.1Gyr (half a rotation period of the Sun about the Galaxy), and 
their positions within the disk can be determined with reasonably accu- 
racy from their period-luminosity relation (§5.1.10). The left-hand panel of 
Figure 10.28 shows the distribution of 278 Cepheid variables with reliable 
distances in a coordinate system in which the Sun is at the origin and the 
direction to the Galactic center is upwards. Two dotted curves from Fig- 
ure 9.20 are superimposed on this distribution: the top curve shows the 
ridge-line of the Sagittarius-Carina arm while the middle curve shows the 
ridge-line of the Orion—Cygnus arm. The distribution of Cepheids is clearly 
enhanced along the Sagittarius—Carina arm. 

The right-hand panel of Figure 10.28 shows the distribution of open 
clusters younger than 30 Myr. In addition to being enhanced near the ridge- 
line of the Sagittarius—Carina arm, the density of young clusters shows signs 
of being enhanced also along both the Orion—Cygnus arm and a third arm, 
the Perseus arm, that passes the Sun in the anticenter direction at a distance 
of ~ 2kpc. 

Additional evidence that star formation is enhanced along the Perseus 
arm is furnished by Figure 10.29, which shows the distribution of HII regions 
with photometric distances: there is a distinct enhancement in the density of 
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HII regions in an elongated region around (y, x) = (3,—1). The Sagittarius- 
Carina arm, by contrast, is not evident in Figure 10.29. 

Alfaro et al. (1991) have shown that the Galaxy’s layer of young clusters 
is not perfectly flat but corrugated in much the same way that the Galactic 
gas layer is corrugated (§9.2.6). Thus, the region of highest cluster den- 
sity oscillates above and below the plane by ~ 50pc with a characteristic 
wavelength of ~ 2kpc. Since clusters form by the fragmentation of dense 
molecular clouds, the corrugation of the cluster layer is presumably a simple 
consequence of the poorly understood corrugation of the gas layer. 


10.5 The halo 


We saw in §4.5 that most galaxies have a globular cluster system that extends 
to beyond R,. It is obviously of interest to see how the Milky Way’s globular 
cluster system compares with those of external galaxies. The Milky Way’s 
globular cluster is especially important because it is associated with a halo of 
field stars. We shall find that field-star halo is nearly two orders of magnitude 
more luminous than the globular cluster system, but its surface brightness 
at the solar radius is so small (~ 30 mag arcsec” — see Problem 10.13) that 
similar halos could not be studied at comparable radii in external galaxies. 
Hence, if external galaxies do possess field-star halos, we can only trace them 
through the association between the local globular-cluster system and field- 
star halo, which we now investigate. 


10.5.1 The globular cluster system 


As we saw in 86.1, there are about 150 globular clusters in the Milky Way. 
Kinman (1959) pointed out that the metal-richer globular clusters seem to 
form a more highly flattened and more rapidly rotating system than the least 
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Figure 10.30 The distribution of the disk (top) and halo (bottom) globular clusters as 
viewed by an observer located at a large distance in the anticenter direction. The dotted 
rectangle in the lower panel indicates the region shown in the upper panel. In the upper 
panel we have superimposed the distribution that would be seen by an observer located 
at a large distance in the direction | = 270°. In each panel the Galactic center lies at the 
origin. [From data published in Armandroff (1989)] 
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metal-rich globular clusters. Zinn (1985) established beyond doubt that such 
a division of the globular clusters into sub-systems with [Fe/H] > —0.8 and 
[Fe/H] < —0.8 is real — clusters of the former, metal rich sub-system, are 
called disk globular clusters or G clusters, while clusters of the metal- 
weak sub-system are called halo globular clusters or F clusters. Since 
Zinn’s analysis, significant improvements in the quality of the data available 
for the metal-rich clusters — which tend to occur in crowded and reddened 
low-latitude fields — prompted Armandroff (1989) to re-evaluated the param- 
eters of the two sub-systems, and we shall largely follow his analysis. 
Figure 10.30 shows the spatial distribution of well-studied disk and halo 
globular clusters. In the upper panel each cluster is shown twice, once as 
it would be observed by an observer who was located far from the Milky 
Way in the anticenter direction, and once as viewed by an observer located 
far down the line | = 270°. Since the halo clusters are more numerous, the 
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lower panel shows them only as seen from the anticenter direction. From this 
figure one sees that the halo clusters form an essentially spherical distribution 
that extends to at least 25kpc, whereas the disk clusters form a flattened 
distribution that does not extend to very large radii. 

The disk clusters are too few in number to define adequately a density 
distribution in the (R, z) plane. However, the idea that the disk sub-system 
might be the analog for globular clusters of the thick stellar disk introduced 
in §10.4.3, suggests that we assume that the density of disk clusters falls off 
with z as e~*/*0 and try to determine the scale height z9. Two facts make the 
value of zo very uncertain. The first is the awkward presence of six points 
along the upper and lower edges of the top panel of Figure 10.30. These 
points, which are associated with the clusters, NGC 6356, 47 Tuc and Pal 11, 
lie at some remove from the main clump of disk clusters, and fit poorly with 
the idea that the cluster density declines exponentially with z. The second 
source of uncertainty in zo is the expectation that clusters are preferentially 
missing from the data set at small z, because it is precisely near the plane that 
obscuration and confusion make it hard to study clusters. Subject to these 
large uncertainties, Armandroff (1989) concludes that zo = 1.1+0.35kpc. 
To within the errors this value is the same as the scale height zo ~ 1.3kpc 
of the thick disk that is inferred from Figure 10.25. 

There are too few globular clusters in each sub-system for the kinematics 
of these systems to be well defined. Nevertheless it is possible to estimate 
a mean rate of rotation and a characteristic velocity dispersion for each 
sub-system. In early work it was generally assumed that each sub-system 
rotates like a solid body at an angular frequency 2;, but, following the work 
of Frenk & White (1980), it is now usually assumed that each sub-system 
rotates differentially, in such a way that the rotation speed takes the same 
value, U;ot, everywhere. In addition to this systematic rotation, each globular 
cluster has a random velocity V;ang. Compounding these velocities, the total 
Galactocentric velocity of the i*” cluster is 

vO = wrote} ty (10.62) 


rand? 
where the expectation value of vi) | is zero, and el) is the unit vector in 
the azimuthal direction at the cluster’s location. While undoubtedly an 
over-simplification, this kinematic model is a reasonable approximation to 
the observed rotation fields of elliptical galaxies (see §11.2.1), which are the 
archetypical hot stellar systems. 

Frenk & White (1980) fitted vot to the line-of-sight velocities of clusters 
because only a few clusters have useful proper motions. Let x) be the 
direction from the Sun to the it cluster. The line-of-sight velocity of this 
cluster is od) = x . (vy — vo). Taking the scalar product of equation 
(10.62) with x“) and rearranging, we obtain 

x.y vo) vroey) RO 4 Vo: KO, (10.63) 


rand — “los 
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Figure 10.31 The distribution 

over luminosity of the Milky Way’s 
globular clusters. The curve is the 
Gaussian distribution given by equa- 
tion (10.65). [From data published in 
Armandroff (1989)] 


By the argument explained in Box 10.1, we now choose the value of v;o4 that 
minimizes the sum over 7 of the square of the left of equation (10.63). One 
easily finds that the required value is 


aD us +Vq°% 


a (@ Ol (10.64) 


Vrot 


Substituting the derived value of v;o¢ into equation (10.63) one obtains the 
line-of-sight component of the random velocity of each cluster, and can hence 
derive oj; for the clusters. 

The greatest source of uncertainty in equation (10.64) for v;o4 is the 


cosines e) . x — especially for clusters that lie near the Galactic center, 
where the direction of eg changes rapidly, so these cosines depend sensitively 
on the rather uncertain cluster distances. The derived value of v,o4 also 
depends upon the assumed value of vo. 

For the halo sub-system, Armandroff (1989) finds vyo¢ = 43 + 29kms~ 
and ojo, = 116+11kms™!, while for the disk sub-system he finds v;o, = 193+ 
29kms~! and o,, = 59+14kms~!. Thus the kinematics of the globular- 
cluster sub-systems bear out the impression one gains from Figure 10.30 that 
the halo sub-system is dominated by random motions while the disk sub- 
system is flattened because it rotates rapidly. For Armandroff’s assumed 
value of the local circular velocity, v.(Ro) = 220kms7}, the rotation rates of 
the halo and disk globular-cluster sub-systems relative to the LSR correspond 
to (vg) = -177kms~? and (vg) = —27kms~', respectively.5 Comparing 
this last number with the value (vs) = —36kms~! given in Table 10.4 for 
the thick disk, one sees that the disk globular-cluster sub-system rotates 
marginally even faster than the local thick disk. 

The data are consistent with both sub-systems having a common lumi- 
nosity function. Figure 10.31 shows the histogram of luminosities for the two 


15 Armandroff adopted a larger solar motion [20kms~! towards (1, b) = (57°, 22°)} than 
that given by equation (10.11). 
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sub-systems combined. It is adequately fitted by [cf. equation (4.49)] 


dN 


tay = Ae ( 


_ (My - My? 
dMy 2 


= ) with M = —7.5,0=1.25. (10.65) 
There is no statistically significant evidence for a correlation of luminosity 
with either Galactocentric distance or distance from the plane. 

Although there are accurate data for only 19 disk clusters, so that statis- 
tical uncertainties are large, the mean metallicity of the disk clusters seems 
to decrease outwards. There is no evidence of a corresponding metallicity 
gradient in the halo sub-system, even though it is more numerous and so 
gives rise to smaller statistical uncertainties. 

Figure 10.32 shows the metallicity distribution of 96 globular clusters 
for which reliable data are available. The division between the disk and the 
halo sub-systems at [Fe/H] = —0.8 is evident. The metallicities of the disk 
clusters are strongly peaked around [Fe/H] = —0.5, while the halo clusters 
are distributed over a much wider range in [Fe/H] that reaches down to 
[Fe/H] = —2.6. In §10.7.2 below we shall see that the leaky-box model of 
§5.3.2 provides a reasonable fit to the metallicity distribution of the halo 
clusters. 


10.5.2 Field halo stars 


Most globular clusters are more metal-poor than any star in the Edvards- 
son et al. (1993) sample that was discussed in §10.4. In the 1940s it was 
discovered that some field stars closely resemble globular-cluster stars, in 
particular by being very metal-poor. These stars are of considerable interest 
for two reasons. First, it is useful to identify very metal-poor stars in the 
solar neighborhood, which can be studied in greater detail than stars located 
in even the nearest globular cluster. Second, very metal-poor stars must have 
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either formed outside the Milky Way or before the Milky Way’s ISM became 
significantly polluted with metals. In either case they must carry significant 
information about the Milky Way’s origins. In particular, we would like to 
know the connection between these stars and the globular-cluster system — 
for example, are they debris from shattered globular clusters? To answer 
such questions we need to know how they are distributed in real space and 
in velocity space, and their distribution in metallicity. 

In the solar neighborhood, the space-density of very metal-poor stars 
is at least two orders of magnitude smaller than the space-density of rela- 
tively metal-rich stars (Dahn et al. 1995). In view of this scarcity, effective 
strategies for locating these objects have had to be developed. A classical 
strategy is to look for RR Lyrae stars whose periods lie in the range defined 
by the RR Lyrae stars of halo globular clusters. These stars have the same 
metallicity distribution as the halo clusters (Suntzeff, Kinman & Kraft 1991) 
and can be identified from their characteristic variability — see §5.1.10. The 
surfaces on which the density of these RR Lyrae stars is constant are ap- 
proximately spherical at r > Ro but become flattened (axis ratio gq ~ 0.6) at 
r < Ro (Kinman, Mahaffey & Wirtanen 1982, Saha 1985, Hartwick 1987). 
Interior to r ~ 25 kpc, the space-density of these stars falls off with Galacto- 
centric radius r as r~* with exponent a in the range 3 < a < 3.5. Beyond 
25 kpc, the density may fall more steeply with increasing radius. The local 
space-density of RR Lyraes is ~ 10 + 5kpc~* (Layden 1995). 

An objection to using RR Lyrae stars as tracers of the halo is that 
they make up only a fraction of the HB stars, and we know from §6.1.2 
that small changes in the abundances and/or ages of these stars give rise 
to significant and ill-understood changes in the disposition of stars on the 
HB. Consequently, one cannot safely infer the total number of halo stars at 
some location from the local density of RR Lyrae stars. Fortunately, from 
the ratio of the numbers of blue HB stars and RR Lyrae stars in a cluster 
one can predict with reasonable accuracy the numbers of red HB stars — see 
Lee, Demarque & Zinn (1994) and Chapter 6. Therefore, by combining a 
search for RR Lyrae stars with one for blue HB stars, one can estimate the 
total number of HB stars, and this, as we saw in §5.1, is tightly coupled to 
the density of all types of stars in an old population. 

Several large-scale searches for blue HB stars have been conducted in 
recent years (Sommer-Larsen, Christiansen & Carter 1989, Preston, Shect- 
man & Beers 1991, Arnold & Gilmore 1992, Kinman, Suntzeff & Kraft 
1994). These studies identify blue HB stars from a combination of objective- 
prism images and the broad-band colors of large numbers of stars. Kinman, 
Suntzeff & Kraft (1994) conclude that at distances z above the plane in ex- 
cess of 5kpc the space-density of HB stars falls off as r~*:5. The density 
vg of HB stars in the solar neighborhood has been determined by Green & 
Morrison (1993) from uvbyG photometry followed by high-resolution spec- 
troscopy — they find vp ~ 51+17kpc*. This value is considerably larger 
than is expected by extrapolating the trend at large z to the plane under the 
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Box 10.4: Velocity dispersions 
from line-of-sight velocities 


Let the k*" star in a survey lie in the direction x), and let the principal 


(k) .(*) 
1 


axes of the velocity ellipsoid at its location have directions e;"’, e,”” and 


ef), The line-of-sight velocity of this star is 
u(*) = x"). yh) = (vi el*) + ust) ef) + vs) ef*)), 
Squaring this equation we find 


3 
k), (k) a (k) a(k 
ve, = > vs dys dg da ) (1) 


i,j=1, 


where a = X(*). eW”) 


etc. If we have many stars at a given location, 
we can average (1) over these stars. Then of yl”) averages to zero for 
i # j, since the components of velocity parallel to the velocity ellipsoid’s 


principal axes are statistically independent, and we find that 
(uis)% = > op a}. 
i 


Usually we have insufficient stars to average them point by point. In 
this case we multiply (1) by (@i*))2 and average over the whole sample. 
We argue that the resulting average will be dominated by intrinsically 
non-negative terms. Hence 
(igs i )sample © O7F} + 038387 + 034327. 

Two similar equations are obtained by multiplying (1) by (a*))2 and 
(as)? before averaging, and then the resulting set of three equations 
can be solved simultaneously for the velocity dispersions 9;. 


assumption that the system is spherical or mildly elliptical. Hence it is likely 
that a large contribution to the local space-density of HB stars comes from a 
highly flattened component, such as the thick disk. Kinman (1994) estimates 
that half to two-thirds of the Milky Way’s ~ 150000 blue HB stars lie in 
this thick-disk component. Since the visual luminosity of globular clusters 
is related to the number of their HB stars by Ly /Nup ~ 540 Lo (Preston, 
Shectman & Beers 1991), ~ 75000 HB stars should correspond to a total 
halo luminosity Ly ~ 4 x 10’ Lo. If the mass-to-light ratio of the halo re- 
sembles that of a globular cluster, Ty ~ 2.5To (§6.1.11), the mass of the 
halo is ~ 1.1 x 10° Mo. 

Since the local density of halo stars is rather small, many studies of the 
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Table 10.6 Characteristic parameters of the halo 


kms~+ 
a r735 (r S$ 25kpc) 4x10’ -185 135 105 90 


halo are based on relatively distant stars. Reliable proper motions will often 
not be available for such stars, so the structure of the halo’s velocity ellipsoid 
must frequently be derived from radial velocities alone. Box 10.4 explains 
how this is done. Unfortunately, the technique described in Box 10.4 requires 
that one know a priori the orientation of the principal axes of the velocity 
ellipsoid at the location of each star. In practice, one usually simply assumes 
that the velocity ellipsoid is oriented parallel to the coordinate directions 
of either cylindrical or spherical polar coordinates, since these are the most 
extreme of all plausible geometries. 

From the radial velocities of a kinematically unbiased sample of 174 
red giants with [Fe/H] < —1.5, Carney & Latham (1986) concluded that 
(o,,09,0$) = (154 + 18,102 + 27,107 + 15)kms~!, and (vg) = —206 + 
23 kms~1, where all velocities are with respect to the LSR. Norris (1986) finds 
(or, 04,02) = (13146, 106+ 6,85 +4) kms“!, and (vg) = -183+10kms7?} 
for objects more metal-poor than [Fe/H] = —1.2, with no indication that 
any sub-population of such metal-poor objects (such as HB stars or globular 
clusters) has different kinematics from the halo as a whole. Norris finds 
that samples of stars more metal-rich than [Fe/H] = —1.2 have values of 
(vg) that rise steadily with [Fe/H] to (vg) ~ —60kms~? at [Fe/H] = —0.6. 
The line-of-sight velocity dispersion of stars with —0.9 < [Fe/H] < —0.6 is 
Olos = 70+10kms 1. Zinn (1985) derives similar numbers for the more 
metal-rich RR Lyrae stars: stars with AS < 2 ([Fe/H] S —0.8; §5.1.10) 
yield (vg) = —32 + 20kms~1, oy, = 644 16kms~!. Layden et al. (1996) 
find (o,,09,04) = (168 + 13,102 + 8,97 +7)kms*, and (vg) = —210+ 
12kms~! for 162 RR Lyrae stars of mean metallicity [Fe/H] = —1.61 and 
(o,,09,0¢) = (56+8,5148, 3145) kms “+, and (vg) = —484+9kms“? for 51 
RR Lyrae stars of mean metallicity [Fe/H] = —0.76. Table 10.6 summarizes 
the characteristic properties of the halo. 


Kinematically selected samples A widely employed strategy for identi- 
fying field halo stars is to select as candidates for spectroscopic investigation 
stars with large proper motions. The reasoning behind this method is that 
halo stars tend to have large heliocentric velocities because they are on orbits 
that differ greatly from the Sun’s. Consequently, they have large proper mo- 
tions out to a greater distance than do disk stars. This strategy is especially 
valuable for identifying halo stars that are still on the MS — the subdwarfs 
— because, unlike HB stars, such stars are too faint to be seen to great dis- 
tances. Hence we are obliged to study them when they pass close to the 
Sun, which is precisely when they have large proper motions — for a recent 
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to star 


Figure 10.33 The velocity-space 
geometry of selecting stars with large 
proper motions. 


determination of the solar-neighborhood density of subdwarfs, see Dahn et 
al. (1995). Unfortunately, great care has to be taken in the interpretation of 
kinematic data obtained from proper-motion samples. An example will best 
explain the problem. 

Let us imagine that we have a population of stars that all have the same 
absolute magnitude M. From this population we select all stars brighter 
than some apparent magnitude mp with proper motions larger than jo. At 
each m < mpg candidate stars lie on the spherical shell S that is centered on 
the Sun and whose radius s satisfies equation (2.36). Let u be the heliocen- 
tric velocity of a star and u; be the component of this velocity perpendicular 
to the direction X to the star. Then from the stars on S we select those with 
|uz| > “od. It is helpful to interpret this constraint in terms of the geometry 
of velocity space: we select only stars that lie outside a cylinder of radius pips 
whose axis is parallel to X and passes through the Sun’s velocity vq — see 
Figure 10.33. Now let us assume that s < Ro so that we may assume that 
stars are uniformly distributed over S. Then we may calculate the probability 
of selecting a star at a given point in velocity space, irrespective of its di- 
rection X, by averaging over directions x the function which is unity outside 
the cylinder and zero inside it. The result of this averaging is constant on 
spherical shells in velocity space, and is therefore a function P,(u) of u = |ul. 
It is not hard to see that P,(u) is equal to the fraction of a spherical shell of 
radius u that lies outside the cylinder of radius ips. A simple integral shows 
that this fraction is 


2.2 
LGs 
P,(u) = 1- ae for u > pos (10.66) 
otherwise. 


Thus, our probability of selecting a star is zero if its heliocentric speed u is 
smaller that os and rises gradually to unity as u becomes large compared 
to pos. 

If the space-density of stars of the type we are studying is no, and of 
these stars a fraction f(u)du have heliocentric speeds in the range (u+du, w), 
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Figure 10.34 Contours of equal 
stellar density in velocity space are 
assumed to center on the velocity of 
the Galactic center. The Sun’s ve- 
locity is located at the center of the 
shaded region. Stars with velocities 
within that region are excluded from 
a proper-motion selected sample. 
The mean rotation velocity of stars 
outside the shaded region is negative. 
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Figure 10.35 Stars selected from 
proper-motion surveys give biased 
kinematic results. Here we plot the 
mean azimuthal velocity of a sam- 
ple selected by proper motion from a 
Maxwellian population of dispersion 
oa. The population-mean of zero is 
attained only when the Sun’s rota- 


0 02 04 06 0.8 1 tion velocity ve is much larger than 
a/V. oO. 


a A a ee a a 


then there are 47ns7ds f(u) du stars in this speed-range in the shell of radius 
s around the Sun. Multiplying by P,(u), dividing by du and integrating over 
8, we obtain the number of stars we expect to pick up in our proper-motion 
survey: 


dNeat 
du 


where Smax = 101+°-2(™0—M) pce is the maximum distance at which stars are 
bright enough to enter the catalog. For simplicity we now assume that our 
magnitude limit mo is faint so that P;,,.(u) = 0 for even the fastest stars — 
in practice this assumption is realistic. Then the upper limit of the integral 
in (10.67) can be set to u/j19, and it is easy to see that the integral evaluates 
to gu? / us. We then have 


= Annp flu) | ies. (10.67) 
0 


ANeat 470 3 
= —> P 10.68 
d= ee ew) (10.68) 
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Now consider the application of this formula in a simple case. Let the 
stars have a Maxwellian distribution of velocities in the rest frame of the 
Galactic center — in particular we are imagining that the population has no 
net rotation around the Galaxy — and let us neglect the solar motion with 
respect to the LSR. With these assumptions, we calculate the mean value 
(vg) of the azimuthal velocity vg for stars in our sample. Figure 10.34 makes 
it intuitively clear that the result of this calculation is going to be negative, 
because the mean of vg over the entire population vanishes, and we will be 
calculating the mean of a sample from which stars with positive values of vg 
have been preferentially excluded. Quantitatively, f(u) is the integral over 
a spherical shell of radius u of the probability density p(v) of finding a star 
at Galactocentric velocity v. Thus 


—v? /20? 


flu) =v? / 22 — 


Substituting this function into equation (10.68) and bearing in mind that vg 
is a function vg(u,Q) of both uw and position 2 on the sphere of radius u 
about the Sun’s velocity (see Figure 10.34), we conclude that the required 


mean is 
ae f duvg(dNeat /du) 
f du (dNeat /de) 
_ fduvgu® f d?Qe~%*/22" 
fduud f d2?Qe-e?/20? 
= f dv vguie ”*/20” 


(10.70) 


where the third line follows from the second because d?v = u?dud?Q. Fig- 
ure 10.35 plots (vg) as a function of v,/o (see Problem 10.14). From this 
figure and the fact that o/v, ~ 0.5 for a typical halo population, it follows 
that the rotation velocity of a proper-motion selected sample is liable to be 
in error by ~ 0.31u, ~ 60kms~!. Thus, raw means and dispersions of sam- 
ples selected from proper-motion catalogs are in themselves meaningless; it 
is essential to correct such quantities for biases introduced by the selection 
criteria (Ryan & Norris 1993). 

Since the selection criteria have blanked off a part of velocity space, and 
the desired means and dispersions are over all velocity space, the necessary 
corrections can, in principle, not be determined from the data. In prac- 
tice, one uses a reasonable model to interpolate from the studied portions of 
velocity space into the portion that was blanked off by the selection criteria. 

Samples of nearby high-velocity stars are particularly suited to the in- 
vestigation of the metallicities of high-velocity halo stars. Figure 10.36 shows 
plots of vg versus [Fe/H] for the samples of Nissen & Schuster (1989) and 
Carney et al. (1996). At the top right corner of each panel there is a dense 
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Figure 10.36 [Fe/H] versus vg from Nissen & Schuster (1989) (left panel, 611 stars) and 
Carney et al. (1996) (right panel 1022 stars). [From data kindly supplied by B. Carney 
and P. Nissen] 


clump of points that is centered on ((Fe/H],(vg)) ~ (—0.4,-40kms~'). 
Stars whose points fall in this clump are considered to be disk stars. In 
both plots this clump is bounded at lower left by a lane of reduced stel- 
lar density that slopes downwards from left to right and passes through the 
points ((Fe/H],(vg)) ~ (—0.3, -225kms~!) and (—1.5,-50kms~') — it is 
significant that this lane is apparent in both plots because the two samples 
plotted in Figure 10.36 have only 130 stars in common. Stars that lie below 
and to the left of the lane are considered to be halo stars. Since the lane 
does not run vertically, the halo and the disk overlap in metallicity, roughly 
between [Fe/H] = —0.9 and [Fe/H] = —1.3. 

The full curve in Figure 10.37 shows the metallicity distribution of 372 
kinematically-selected halo MS stars. The distribution extends to lower val- 
ues of [Fe/H] than does the distribution of globular clusters plotted in Fig- 
ure 10.32. The dashed curve in Figure 10.37 shows that, on account of 
this fact, the leaky-box model of §5.3.2 can fit the data better than it can 
the metallicity distribution of the halo globular clusters. The prediction 
of the leaky-box model lies below the observed metallicity distribution at 
[Fe/H] 2 —1. This discrepancy indicates that the formation of the thick 
disk involves processes that lie beyond the scope of the leaky-box model. 
Presumably, the missing ingredients are the accumulation within the disk of 


678 Chapter 10: Components of the Milky Way 


Figure 10.37 Full curve: a nor- 
malized, generalized histogram of 
the metallicity distribution of 372 
kinematically-selected halo MS stars. 
Dashed curve: the distribution pre- 
dicted by the leaky-box model with 


0 Yerr = 0.025. [After Ryan & Norris 
-4 -3 -2 -1 0 1 (4991) from data kindly supplied by 
[Fe/H] S. Ryan] 


metal-rich debris from the formation of the halo, and the tendency for the 
disk to form a closed rather than a leaky box. 

Since halo stars are much rarer in the solar neighborhood than disk stars, 
the detailed pattern of abundances in halo stars is harder to study than it 
is in disk stars. Notwithstanding the difficulties, Gilroy et al. (1988) have 
obtained high-resolution spectra of 20 halo stars and studied their elemental 
abundances in detail. They conclude that there are real variations from 
star to star in the relative abundances of the elements. In the most metal- 
weak stars, r-process elements are relatively more abundant than s-process 
elements. This result suggests that very massive and short-lived stars were 
relatively important contributors of heavy elements to the most metal-poor 
stars. 


10.6 Galaxy models 


Now that we have completed our survey of the structure of the Milky Way, we 
are ready to discuss models of the Galaxy. Such models play an important 
réle in astronomical research by facilitating the interpretation of observa- 
tional data. For example, one might use a model to estimate the location in 
the Galaxy of an HII region from the wavelength of it emission lines, or one 
might use a model to estimate how many stars of a given color and apparent 
magnitude one would detect in a field at some given values of J and b. 

The construction of Galaxy models is also an important discipline in 
itself because it obliges one to assemble a coherent set of values of the various 
parameters that describe the Galaxy — the sizes, masses, shapes, and so 
forth, of each of its components. Many of these numbers must be inferred by 
rather indirect arguments, and there are frequently several different ways of 
estimating a given number. Building a model Galaxy forces one to decide on 
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Table 10.7 Some local parameters of the Milky Way 


(kms~1) (kms~! kpc™*) (Mo pe-?) 
Uc(Ro) A Bs Xtot(Ro) Ugas(Ro) Ta(Ro) 
220 148 ~12.4 48 5.5 2.3 


Table 10.8 Location of Milky-Way parameters 


Type of parameter Location 
Distance to the Galactic center, Ro Table 7.1 
Photometric parameters, Ry etc. Table 10.1 


Disk velocity dispersions, ap etc. Table 10.4 
Tangent velocity as a function of | Table 9.3 
Halo velocity dispersions, og etc. Table 10.6 


a single value for each number, and to check that any relations that should 
hold between these numbers are, in fact, valid. 

Table 10.7 gives values for six of the Galactic parameters that relate 
to the solar neighborhood, while Table 10.8 indicates where the values of 
many other parameters are tabulated elsewhere in this book. Methods for 
determining all but one of these parameters have already been discussed 
elsewhere, and here we need only discuss v,(Ro), the local circular speed. 


The local circular speed The obvious strategy for determining v,(Ro) 
is to measure the Oort constants and then to note from equations (10.23) 
and (10.29) that v.(Ro) = Ro(A—B) [cf. equation (10.31)]. Using the values 
of A and B determined by Feast & Whitelock (1997) from Hipparcos proper 
motions [equation (10.36)], we find 


Uc(Ro) = (218 + 8)(Ro/8 kpc). (10.71) 


The Oort constants describe the streaming of stars in a small patch of 
the disk around the Sun, so it is surprising that we can determine from them 
the rate at which the entire solar circle rotates. What allows us to make this 
leap from the small- to the large-scale is our hypothesis that stars stream 
precisely on circles. In §10.3.3 we saw that this hypothesis is supported 
observationally by the fact that the quantities c and k defined by equation 
(10.18) are zero to within the errors. Nonetheless, we know that the Galaxy is 
not exactly axisymmetric, if only because it has a bar at the center and spiral 
structure further out. So we should feel uncomfortable about estimating a 
parameter as fundamental as v,(Ro) from equation (10.31) alone. 

The measurement of the proper motion of Sgr A* mentioned at the end 
of §10.3.3 provides a direct route to v-(Ro). From equation (10.38) we have 


Ue(Ro) = (241 + 7)(Ro/8 kpc). (10.72) 
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In connection with this estimate, it should be stressed that the quoted error 
is a formal one: it takes no account of the possibility that Sgr A* has a 
non-zero peculiar velocity. 

Prior to the advent of the Hipparcos catalog, B was so uncertain as to 
make equation (10.31) of little use in the determination of v.(Ro). Moreover, 
it is only recently that accurate VLBI measurements of the proper motion 
of Sgr A* have been available. Consequently, astronomers have traditionally 
had to estimate v.(Ro) from a variety of more-or-less unsatisfactory argu- 
ments. These include: an argument based on the assumption that the system 
of halo globular clusters and RR-Lyraes has no net rotation; an argument 
based on the speeds of the fastest stars seen near the Sun (Problem 10.16); 
an argument that requires an estimate of the systemic velocity of the Milky 
Way with respect to the rest of the Local Group; and the use of the ARo 
formula (Box 10.3). Since none of these methods is competitive with the two 
methods described above (both of which ultimately exploit radio-frequency 
VLBI), we shall say no more about them here. 


Mass models __ Several distinct types of model can be identified. Schmidt 
(1956) and Caldwell & Ostriker (1981) pioneered the most basic type of 
model, mass models. A mass model consists of specified density distribu- 
tions for one or more components of the Galaxy. The combined gravitational 
force-field of these components is calculated and the parameters of the density 
distributions are adjusted to optimize the agreement between the kinematic 
predictions of the model and observables such as the values of the Oort con- 
stants, the run of terminal velocities with radius, the total surface density of 
matter near the Sun, Dtot(Ro), and so on. 

In the mass models of Dehnen & Binney (1998b), the Galaxy is repre- 
sented by a bulge, a halo and three disks: thin and thick stellar disks and 
the disk of the ISM. The density distributions of the two stellar disks are 
given by the first of equations (10.3) with Rg the same for both disks and 
Zo = 180 pc and z = 1kpc for the thin and thick disks, respectively. The 
gas disk is assumed to contribute 25% of the local surface density and to 
have a value of Rg twice as great as that for the stellar disk, while its scale 
height is small: z = 40pc. The gas disk is assumed to have a hole at its 
center. The density distribution of the bulge is assumed to be given by the 
second of equations rmax = 1.9kpc and cp = 0. 

The density distribution of the halo is taken to be of the form 


p(R, z) = po i) : (1 - = "emi, where m = (R? + 27/q?)/?. 
; : (10.73) 
Hence, at small radii the density is proportional to r~? for r < rp and 
proportional to r~ for r >> ro. Results are given for two values of the axis 
ratio g: 0.8 and 0.3. The exponents are constrained to satisfy -2 <7 < 1.8 
and 8 > 1 to limit the sharpness of the inner and outer edges of the halo, 
respectively. 
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Figure 10.38 Circular-speed 
curves of four mass models. 

In each panel the full curve 
shows v¢(R) for a complete 
model, while the other curves 
show contributions: short- 
dashed, bulge; dotted, disks; 
dot-dashed, disks plus bulge, 
long dashed, halo. All models 
assume Ro = 8kpc and halo 
axis-ratio g = 0.8. From top 
to bottom the values of Ry/Ro 
are 0.25, 0.3, 0.35 and 0.4. [Af 
0 ter Dehnen & Binney (1998b)] 
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The free parameters of the model (the values of the scale lengths and 
surface densities of the disks, the normalization of the bulge, and the nor- 
malization, scale length and exponents of the halo) are constrained by the 
following observational data: 

1. the observed terminal velocities (Figure 9.16); 
the run of W(R) at R > Ro (Figure 9.18); 
the values of the Oort constants [equation (10.36)]; 
the total surface density D1.1(Ro) [equation (10.54)]; 
the contribution of the disks to /41, which was required to exceed 
40 Mo pe~?; 
6. the velocity dispersion of bulge stars in Baade’s window (§10.2.4); 
7. the total mass inside 100 kpc, which was taken to be (7£2.5) x 10!! Mo. 


Figure 10.38 illustrates the fact that very different mass models prove com- 
patible with these constraints. Each panel shows the circular-speed of a 
model (full curve) and the contributions to it from the bulge (short-dashed 
curve), the disks (dotted curve) and the halo (long-dashed curve). The top 
model has a small disk scale length (Rg = 0.2kpc) and its circular speed is 
dominated by the disks and bulge out to R 2 10kpc. (The dot-dashed curves 
show the contribution to v, from bulge and disks together.) By contrast, the 
bottom model has Rg = 3.2kpc and a circular-speed curve that is dominated 
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by the halo at all radii. In fact, as one proceeds from top to bottom in Fig- 
ure 10.38, Rg is steadily increasing, and this causes mass to move from the 
disks, first to the bulge and later to the halo. What drives this transfer of 
mass is the fact that the shape of the terminal-velocity curve in Figure 9.16 is 
appropriate for disks with Rag/Rp ~ 0.25 rather than any larger value. Since 
the terminal velocities have been measured with high precision, they carry 
great weight in the determination of the model’s parameters, with the result 
that a component that can fit all terminal velocities at once receives plenty 
of mass. If one specifies a value of Ra/Ro that does not suit the terminal 
velocities, the fitting procedure brings the bulge and the halo into play so 
that it can make up the tangent velocities from a suitable combination of all 
three components. 

Notice that there are significant differences between the four circular- 
speed curves in Figure 10.38 at R > Ro. These differences reflect the fact that 
ve has not been accurately measured at R > Ro — in fact, from Figure 9.18 
it is clear that the data are essentially non-existent for R > 2Ro. 

By showing that the inner Galaxy can be dominated by either the disks 
or a nearly spherical halo, Figure 10.38 indicates that we know next to noth- 
ing about the Galaxy’s vertical distribution of mass. This ignorance arises 
because the only constraints that bear on this distribution are items (4) 
and (5) above, and these constraints apply too close to the plane to provide 
much information about what lies kiloparsecs above the plane. In the coming 
years, this crucial lack of information may be filled by dynamical models of 
the Galaxy (see below), and studies of gravitational lensing by disk galaxies. 


Star-count models In §10.1 we discussed models in which each compo- 
nent of the Galaxy consists of a smooth distribution of luminosity. Models 
of this type are suitable for analyzing data from satellites that use small 
telescopes to measure the Galaxy’s surface-brightness distribution with low 
angular resolution, but are of limited value for studies that employ large- 
aperture telescopes. In connection with such studies, one needs a model 
that predicts star counts, such as those described in §10.4.3. A model with 
this capability specifies not merely how much luminosity is produced at each 
point in the Galaxy, but the precise distribution over absolute magnitude of 
the stars that produce the light. That is, a star-count model specifies for all 
M and x the luminosity function, ®(M,x), defined by equation (3.20). 

Star-count models, which are reviewed by Bahcall (1986), were pioneered 
by Bahcall & Soneira (1980). They specified ®(M,x) by decomposing the 
Galaxy into two components and assigning a position-independent luminos- 
ity function 6(M) to each component. Since the proportion of the total 
luminosity at x that comes from each component varies with x, this proce- 
dure makes the overall luminosity function position-dependent. Bahcall & 
Soneira (1984) concluded that star-count data for five widely dispersed fields 
in several wavebands are consistent with standard disk and globular-cluster 
luminosity functions provided: 
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1. the luminosity density of the disk is given by the first of equations (10.3) 
with Rg > 2.5kpc, and z = 350 + 50pc for MS stars fainter than 
My =5, and z = 250+ 100pc for more luminous stars; 

2. the luminosity density of the halo obeys the R1/4 law [equation (4.18)| 
with effective radius R, ~ 3 Ro, axis ratio q = 0879 463 and normalized 
such that near the Sun one star in ~ 500 is a halo star. 


Bahcall & Soneira (1984) and Bahcall (1986) argue that star counts in dif- 
ferent colors are compatible with the existence of a thick disk only if the 
luminosity function of this component is similar to that of the halo. 


Kinematic models Many studies involve either proper motions or radial 
velocities in addition to positions and apparent magnitudes. Consequently, 
there is a need for models that predict the velocity distribution of stars at 
each point in the Galaxy, in addition to the number density of stars as a 
function of absolute magnitude. 

The simplest models with this capability assume that at each point x 
the velocity distribution of any given component can be represented by the 
Schwarzschild distribution [equation (10.17)]. In general, this Schwarzschild 
distribution must be assumed to be about a mean-streaming velocity V that 
depends on x, and the principal velocity dispersions 0; will also depend on x. 
We call such a model a kinematic model to distinguish it from models that 
explicitly take into account dynamical constraints — see below. Ratnatunga, 
Bahcall & Casertano (1989) describe a kinematic model and use it to predict 
for several directions the distribution of proper motions of stars listed in the 
Bright Star Catalog (Appendix B). 


Dynamical models An accurate kinematic model is exceedingly hard to 
construct because V, the o; and the orientation of the axes of the velocity 
ellipsoid all have to be specified as functions of x. Moreover, Schwarzschild’s 
ellipsoidal hypothesis is only an approximation to the truth, so after an 
immense amount of work specifying the model, one is liable to be left with 
a inadequate representation of the Galaxy. 

If we assume that the Galaxy is in an approximate steady-state, the 
value of v at one point, x, cannot be varied independently of the value 
of v at neighboring points, x’, because stars that are at x at one instant 
are at x’ somewhat later. This argument suggests that, in a steady-state 
Galaxy, v and the o; satisfy differential equations. §4.2 of BT describes the 
Jeans equations for v and o; that would hold in a satisfactory model. If 
there were a clean procedure for finding solutions of the Jeans equations, this 
procedure would eliminate most of the freedom that we have when specifying 
a kinematic model. We might then, at acceptable cost, be able to find a model 
that provided a reasonable representation of the Galaxy. Unfortunately, the 
Jeans equations do not pose a tractable boundary-value problem, so they 
have proved of limited value in the construction of Galaxy models. 

A more promising, but still undemonstrated, way to incorporate dy- 
namical constraints in Galaxy models is based on the ‘Jeans theorem’ that 
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is discussed in §4-4 of BT. Dehnen & Binney (1996) describe this approach 
and present some preliminary results. An alternative, conceptually simpler, 
approach is to construct an N-body model that looks like the Galaxy — see 
Sellwood (1993) or Fux (1998) for promising examples of this procedure. 
The central difficulty with this approach is the problem of choosing initial 
conditions from which the N-body model will relax to a configuration like 
that of the Milky Way. 


10.7 Formation and evolution of the Milky Way 


What can we infer from the observations of disk, halo and bulge that we have 
studied in this chapter about the history of the Milky Way? Astronomers 
have now debated this question for several decades without reaching a con- 
sensus. In view of this unsatisfactory situation, we cannot present an au- 
thoritative discussion of this fascinating topic. We start by presenting some 
of the more frequently cited lines of argument. We then outline what seems 
to us the most likely route by which the Milky Way arrived at its present 
state, and discuss some respects in which this picture pulls together disparate 
lines of evidence from observations of both the Milky Way and of external 
galaxies. 


10.7.1 Formation scenarios 


Discussion of the formation of the Milky Way has been profoundly influenced 
by a paper by Eggen, Lynden-Bell & Sandage (1962). By studying model 
orbits of solar-neighborhood stars for which space velocities had been deter- 
mined, these authors reformulated the classical conclusion of Roman (1955) 
that high-velocity stars near the Sun tend to be metal-poor.!® They showed 
that the eccentricities, e, energies of vertical oscillation, H,, and angular 
momenta, L,, of their model orbits are correlated with stellar metallicity in 
the sense that as metallicity decreases from solar, e and EF, increase, while 
L, decreases.1’ From these correlations Eggen et al. inferred that either the 
Milky Way had a violent history, or most low-metallicity stars cannot have 
formed in a centrifugally supported disk. The argument from which they 
drew this conclusion is as follows. 

Both e and L, are ‘adiabatic invariants’ in the sense that their values for 
a given star change negligibly if the potential in which the star moves changes 
slowly — see §3.6 of BT for more detail. Hence, either low-metallicity stars 
formed on orbits that have large values of e, or the potential of the Milky 


16 The metal-poverty of a star was inferred from its ultra-violet excess — see §5.1.5. 

17 Since the possession of a large velocity with respect to the Sun is synonymous for a 
solar-neighborhood star with moving on either an eccentric or a highly inclined orbit, the 
correlations of Eggen et al. did not represent an important advance on the work of Roman 
(1955). 
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Way has changed rapidly at some time since they formed. In the former case 
these stars cannot have formed in a centrifugally supported disk, and in the 
latter case the Milky Way has had a violent history. 

On the strength of these observations, Eggen et al. presented a picture 
of the formation of the Galaxy. In this picture, the Milky Way formed from 
an approximately spherical, spinning protogalactic cloud. Initially this cloud 
was metal-poor and in near free-fall. As it collapsed, its rate of spin increased 
to conserve its angular momentum, and from it condensed the most metal- 
poor stars and halo globular clusters. The present eccentric orbits of these 
objects were a direct consequence of the free-fall of the protogalactic cloud. 
The approximately coeval nature of globular clusters (§6.1.4) was believed to 
be a consequence of the rapidity of this collapse. Supernovae increased the 
metallicity of the cloud as it collapsed. After shrinking in radius by a factor 
of order 10, the cloud became metal-rich and flattened into a centrifugally 
supported disk. At this point disk formation commenced and the Milky Way 
settled down to something like its present configuration. 

A very different picture of the formation of the Milky Way’s halo was 
suggested by Searle (1977). This picture was inspired by the discovery that 
the distribution of metallicities among halo globular clusters is wide and 
approximately independent of Galactocentric radius (see §10.5.1). This phe- 
nomenon would be expected if the halo formed from a large number of inde- 
pendent fragments of mass ~ 10® Mo, which evolved independently of one 
another. The metallicity of a given fragment would be determined by the 
number of ‘enrichment events’ that took place within it before supernovae 
swept it clean of gas. Searle envisaged an enrichment event to consist of 
the formation of an entire globular cluster. This assumption ensured that 
each fragment experienced only a handful of enrichment events, and Pois- 
son statistics in the number of enrichment events that occurred in different 
fragments gave rise to significant scatter in the fragments’ final metallicities. 

Since the papers of Eggen, Lynden-Bell & Sandage (1962) and Searle 
(1977), N-body simulations of gravitational clustering in an expanding Uni- 
verse have heavily influenced the conventional picture of galaxy formation. In 
these simulations, stellar systems are constantly merging to form larger ones. 
It is very tempting to identify Searle’s fragments with the stellar systems that 
early on merged to form the Milky Way. In simulations, as fragments merge, 
the potential of the forming Galaxy is very far from axisymmetric and the 
angular momenta of individual stars are by no means conserved, contrary to 
the assumption of Eggen et al.. Hence, in its original form, the picture of 
Eggen et al. is largely obsolete. 

It would, however, be a mistake to conclude that galaxies such as the 
Milky Way formed simply by the merging of pre-existing stellar systems. In 
particular, whatever may be the history of the halo, the disk certainly did 
not form in this way. Nor is it likely that stars of the thick disk and disk 
globular clusters formed outside the Galaxy as the simplest merger picture 
envisages. The origin of the bulge is very uncertain, but it is likely that the 
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material of which it is made entered the Galaxy as gas rather than as stars. 
Thus in as much as Eggen et al. emphasized the formation of stars from gas 
that had already been incorporated into the Galaxy, their picture describes 
an important aspect of what actually happened. 

Objects that look remarkably like young globular clusters have formed 
recently in the LMC. The LMC is known to be spiraling into the Milky 
Way, and will eventually be absorbed by our Galaxy (see §7.1 of BT). Once 
the LMC has been absorbed, its globular clusters will become Milky-Way 
globular clusters. Could all the Milky Way’s globular clusters have been 
acquired by the shredding of low-luminosity galaxies such as the LMC? We 
do not know the answer to this question, but there is some intriguing evidence 
to suggest that both globular clusters and field halo stars have been accreted 
(§6.1.4). 

In §5.3.2 we saw that the temperature of interstellar gas is unlikely to 
lie in the interval 104K < T < few x 10°K. Whatever its origin, gas that 
cools to S 104K will sink inwards in the potential of a galaxy such as the 
Milky Way until it forms a centrifugally supported disk — encounters between 
gas clouds are inelastic so clouds must sink inwards until encounters become 
unimportant, which is possible only if all clouds are on similar circular orbits. 

As gas accumulates in the disk, the latter is liable to become gravi- 
tationally unstable and to form stars. It is natural to expect stars to form 
from the densest gas clouds, and these clouds move on the most nearly circu- 
lar orbits (§9.2.6). Hence we expect, and observations confirm (§10.4), that 
stars form in the plane on nearly circular orbits. Now it is certain that stars 
will in time diffuse away from the thin layer in which they form, because a 
thin stellar disk is a fragile object. Indeed, in such a disk all orbits must 
be highly circular and confined to a common plane. Any random process 
that is capable of perturbing the orbits of stars is liable heat and thicken the 
disk by disturbing this highly ordered state. §7.5 of BT describes two such 
randomizing processes, namely scattering by molecular clouds and by spiral 
arms. Figure 10.22 presents observational evidence that these processes are 
at work. The best-available estimates (Jenkins 1992) suggest, however, that 
they are not powerful enough to effect the formation of the thick disk. 

A widely canvassed origin of the thick disk appeals to the accretion 
of low-luminosity stellar systems by galaxies such as the Milky Way. As 
a satellite galaxy spirals through the outer reaches of the Milky Way, its 
orbit is dragged into the plane (Quinn & Goodman 1986). Then as it spirals 
through the disk it can, in principle, convert a thin disk into a thick disk 
(Walker, Mihos & Hernquist 1996).!8 Thus the following may be the origin 
of the thick disk. About 10 Gyr ago most of the stars of the thick disk formed 


18 In this connection it is worth remarking that the fragility of a thin disk is such that it 
can be appreciably thickened by even a low-mass satellite galaxy (Toth & Ostriker 1992). 
In particular, the satellite’s stars could now be inconspicuous amid the more numerous 
stars of the thickened disk. 
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a thin disk. Then one or more dwarf galaxies were accreted by the Milky 
Way, and, as they spiraled through the disk, they thickened it. Subsequently 
a new thin disk formed within the thick disk from gas that had settled into 
the plane. 

We have seen that there is an intriguing similarity between the physical 
parameters of the thick disk and the disk globular-cluster system. Does this 
similarity reflect a similar physical origin? We do not know the answer to 
this question, but there is some evidence that globular clusters form when a 
gaseous disk is strongly disturbed (Zepf & Ashman 1993). Thus, it is possible 
that the disk globular clusters were formed from the Galaxy’s gaseous disk 
as the satellite(s) spiraled in, churning up the disk. This conjecture explains 
naturally why the metallicities of the disk globular clusters are narrowly con- 
fined to the value that is characteristic of the thick disk (([Fe/H] = —0.6; see 
Figure 10.32). Moreover, so long as the mass of the perturber is larger than 
the masses of an individual globular cluster, this hypothesis also explains 
why the disk globular clusters are distributed through the same volume as 
the thick disk stars: in these circumstances objects less massive than the 
perturber are scattered independently of their mass. 


Finally, what is the origin of the bulge? The first point to make is 
that both the thick disk and the innermost halo are likely to make signif- 
icant contributions to the luminosity that comes from the volume that we 
conventionally associate with the bulge. This fact is bound to complicate 
any discussion of the bulge. However, from observations in Baade’s Window 
(§10.2) it is clear that the bulge is more than a superposition of the thick disk 
and the halo, because it contains mostly stars that are more metal-rich than 
even the thick disk. It seems likely that much of the heavy-element content 
of the bulge was synthesized in stars that do not belong to the bulge, but 
to the halo or the disks. Metal-rich gas ejected by these stars would natu- 
rally have fallen onto the bulge and, at some stage, bulge stars formed that 
incorporated some of this accreted gas. 

The discovery that the bulge is non-axisymmetric must have a significant 
bearing on its origin. One relevant dynamical fact is that a disk, in which 
the radial velocity dispersion becomes large, is liable to buckle and thicken 
(Combes & Sanders 1981, Raha et al. 1991, Kalnajs 1996). This process 
might conceivably have formed the bulge from the center of the thin disk, 
and indeed there is some evidence that this kind of evolution is taking place 
in some external galaxies (§4.4.7). One possible cause of the large velocity 
dispersion required to buckle a disk is a bar that has formed in the disk 
and then ramped up the eccentricities of disk stars’ orbits. However, at the 
present time too little is understood about the structure of the bulge and 
the dynamics of bars to make further speculation on the origin of the bulge 
profitable. 
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Figure 10.39 The metallicity distri- 
bution of the more metal-poor glob- 
ular clusters shown in Figure 10.32 
(histogram) is reasonably well fitted 
L J by the prediction [equation (5.43)] 
r of the leaky-box model with effective 
VV Saeige eens, SEE yield pes = p/(1 +c) = 5.7 x 10-4 
[Fe/H] (curve). 


10.7.2 Models of the chemical evolution of the Milky Way 


A measure of support for the theoretical framework outlined above can be 
deduced from the theory of chemical enrichment developed in §5.2. we start 
by using the leaky-box model of §5.3.2 to model the chemical evolution of 
the halo. 


Chemical evolution of the halo Figure 10.39 compares the observed 
metallicity distribution of the halo globular clusters with that predicted by 
the leaky-box model [equation (5.43)]. The fit is moderately good with ef- 
fective yield peg = p/(1+¢) =5.7 x 1074. 

Pre-enrichment One clear implication of the picture developed above is 
that the present thin disk formed from material that was relatively metal- 
rich: we might reasonably conjecture that the ISM from which it started 
to form was as metal-rich as the thick disk. To test this conjecture, let us 
investigate the consequences of assuming that, on the contrary, the thin disk 
formed from metal-poor gas. 

At the end of §5.3.1 we showed that the closed-box model of chemical 
evolution predicts that one half of all stars near the Sun have less than a 
third of the metallicity of the most metal-rich stars. Since the latter have 
metallicities comparable to that of the Sun, the closed-box model implies 
that a half of solar-neighborhood stars should have ZS 320: Actually only 
~ 2% of disk F and G stars in the solar neighborhood have Z < 0.25Z@ 
(Sommer-Larsen 1991). This contradiction between the standard closed-box 
model and observation is known as the G-dwarf problem.'® Evidently one 
of the assumptions of the standard closed-box model is incorrect. 

Consider first the possibility that the yield p is not independent of Z, and 
hence that equation (5.37) is not the correct integral of equation (5.36). One 
way in which this might happen is if the IMF were metallicity-dependent. 
For example, if no low-mass stars are born when Z < 0.25Zo, the G-dwarf 


19 Mould (1982) has argued that a similar result applies to M dwarfs, although the data 
for M stars are less reliable because the ultraviolet excess 6(U — B) (see Box 5.4) cannot 
be used as a metallicity indicator for these stars. 
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problem would be resolved within the frame-work of the closed-box model. 
One problem with this proposal is that it implies that the yield p decreases 
with increasing Z, which is the reverse of the trend that is required if the 
closed-box model is to account satisfactorily for metallicity gradients within 
galactic disks (Peimbert & Serrano 1982). A second, and perhaps more 
serious, problem was pointed out by Thuan, Hart & Ostriker (1975), who 
examined a two-parameter family of closed-box models in which the IMF 
— and therefore the yield — as well as the overall rate of star formation are 
continuous functions of time. They found that such closed-box models can 
reproduce the observed scarcity of low-metallicity dwarf stars only if the 
IMF at early times is so strongly biased against the formation of low-mass 
stars that the rate of formation of these stars actually increases in time. 
Unfortunately, an increasing rate of formation of low-mass stars corresponds 
to an increasing rate of consumption of interstellar gas, so the closed-box 
models that are compatible with the stellar metallicity distribution predict 
that the interstellar gas will be exhausted within the next Gyr or so. It seems 
unlikely that we are privileged to live so close to the end of the epoch during 
which our galaxy forms new stars. Thus, the scarcity of metal-poor stars 
near the Sun obliges us to drop at least one of the other central assumptions 
of the standard closed-box model — either that the gas is initially metal-free 
or that gas neither enters nor leaves any particular annulus of the Galaxy. 

In §5.3.3 we showed how the scarcity of metal-poor stars near the Sun 
can be naturally explained within the framework of the accreting-box model 
of chemical evolution. Here we show that it may be just as naturally ex- 
plained by supposing that the thin disk inherits heavy elements from both 
the halo and the thick disk. As a crude short-hand, we refer to these two 
components as the ‘spheroid’. 

If the gas from which the Galactic disk formed started from metallicity 
Z;, the integral of equation (5.36) becomes 


ork 


Z(t) =Z+pln | (10.74) 


and there will be no disk stars with Z < Z;. Furthermore, when we insert 
the observed value of M,(0)/Mg(to) and Z, = 0.25Zo into this equation, we 
deduce p ~ 0.007, in better agreement than our previous solar-neighborhood 
estimate, p ~ 0.009, with the value that is deduced from observations of 
Magellanic irregulars (§8.2.6). Therefore, we may retain many of the attrac- 
tive features of the closed-box model while clearing up the G-dwarf problem, 
if we can explain why Z; ~ 0.25Ze in the solar neighborhood, and Z; ~ 0 in 
Magellanic galaxies. 

One difference between our galaxy and the Magellanic Clouds is that 
our galaxy has a significant spheroid and the Magellanic Clouds do not. In 
§5.4 we argued that the mass, My, of metals produced by an old stellar 
population and the present bolometric luminosity Dy.) of the population are 
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in the ratio My/Lpoi & (0.16 £ 0.05) Mg LG" [equation (5.70)]. If we apply 
this relation to the Galactic bulge, and assume that the disk was endowed 
with a fraction f of these metals at its birth, we conclude that the initial 
metallicity of the disk was 


= fMny Ls, 0.16 f Lpuige 


Z; i — ’ 
Ma Tadoi Laisk 


(10.75) 


where Yqpo) is the bolometric mass-to-light ratio of the disk and the ratio of 
the luminosities of bulge and disk is Lpuige/Laisk © 0-25 (Table 10.1). Thus 
equation (10.75) implies 


eee 


Z, = 0.04 
Ta bol Tabol 


Zoe (10.76) 


Equation (10.57) suggests that Yqpo1 lies in the range 1 to 2, so the G-dwarf 
problem would be resolved if a fraction f = 0.2 — 0.3 of the metals made in 
the bulge polluted the primordial disk. 

When applied to external galaxies, this simple model of pre-enrichment 
of the thin disk by the spheroid has two interesting consequences. First, since 
spheroids are more centrally concentrated than disks, Z; should be higher 
near the center of a spiral galaxy than at very large radii. Hence we expect 
the present value of Z to increase toward the centers of disks even more 
rapidly than equation (5.37) would suggest. That is, this model predicts that 
the effective yield peg [equation (8.62)] should increase towards the centers of 
galaxies. In §8.2.6 we saw that this increase is exactly what is observed. We 
also know that most of the luminosity of a rich cluster of galaxies such as the 
Coma cluster comes from elliptical galaxies and the spheroids of SO galaxies. 
One possible explanation of why these galaxies did not form disks is that 
encounters between neighboring galaxies tore the proto-disk material out of 
the galaxies before the gas could fragment into stars (Binney & Silk 1978). 
Our pre-enrichment picture suggests that this material should be metal-rich 
(De Young 1978; Binney 1980). X-ray observations show that this material 
has Z ~ 0.3Z 5, as expected (e.g., Sarazin 1986). 
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10.1 Ifa spherical body has a density distribution p(r) = Ko(r), where Ko 
is a modified Bessel function, show that its projected density is L(R) = 
Ze (Merrifield & Kent 1989). [You may need the integral relation 

f d@ cosh 6 Ko(Rcosh @) = Ze~*.] 


10.2 Freudenreich (1998) gives the L-band luminosity density at the center 
of the Galactic bar [equations (10.4)] as 7,(0,0,0) = 4.54MJysr—t kpe™’. 
Use numbers listed in Table 2.1 to show that, at a distance of Dpc, the 
L-band flux from the Sun is 1400/D? Jy. Hence derive the value of the bar’s 
central surface brightness, [,, that is given in Table 10.1. 
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10.3 From Table 10.1, show that at Rp the luminosity density of the Galactic 
disk is ~ 50 Lo pe~”. Show from this result that the local K-band surface 
brightness of the disk is ~ 20.6 mag arcsec” ?. 


10.4 Figure 10.2 shows that the integrated surface brightness of the Milky 
Way at its poles is dominated by stars brighter than B ~ 16. Explain this 
phenomenon using the data in Table 3.16 and Table 10.1. 


10.5 Consider a sample of stars with well determined parallaxes and proper 
motions but no radial velocities. For each star, we define a 3 x 3 matrix A 
by Ai; = 6;; — fi7;, where f = (71, 72,73) is the unit vector from the Sun to 
the star. Show that the data allow us to determine p = A -v for each star, 
where v is the star’s heliocentric velocity. Show that (v) = (A)~* - (p) is 
the choice of u that minimizes the quantity S?(u) = (|p — A-ul?), where 
angle-brackets imply sample averages. Explain why this calculation implies 
that the Sun’s motion relative to the sample is given by vq = — (v), and 
that the minimum value of S? is a measure of the intrinsic velocity dispersion 
of the stars. 


10.6 Use the formulae derived in §3.2.3 of BT to estimate the pericenter 
and apocenter of the Sun’s orbit around the Galaxy. 


10.7 In the construction of the Gliese (1969) catalog of nearby stars, the 
“solar neighborhood” was defined to be a sphere 40 pc in diameter. By how 
much does the standard of rest vary across this sphere? 


10.8 The vorticity of a fluid flow v(x) is w = V x v. In the case of a disk 
galaxy such as the Milky Way, this flow represents the differential rotation 
of the stars. Show that locally w is simply related to the Oort constants by 
|w| = 2B. 


10.9 The numbers of stars observed by Gilmore & Reid (1983) with 4 < 
My <5 at various distances z from the plane are 3, 7, 19, 22, 38, 50, 79, 
107, 135, 149, 197, 207, 218, 233, 265, 301, 322. Each entry corresponds to 
an interval of width 0.1 in log z and the first entry is for the bin centered on 
log(z/1pc) = 2.05. Reproduce Figure 10.25 from these data. 


10.10 Show that the second term on the right-hand side of equation (10.43) 
can be expressed in terms of the Oort constant as 


B? — A? 


Show that with the Oort constant given by equation (10.36) we have pr ~ 
—0.0024 Mo pe 3. 


10.11 By writing Poisson’s equation, V -F = —4xGp, in spherical polar 
coordinates, show that the density py of a spherical halo is connected to the 
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contribution vz, that it makes to the Milky Way’s squared circular speed v? 
by 
1 2@ 


Hence, show from equation (10.43) that a good estimate of the surface density 
of the disk in the solar neighborhood, Xp(Ro), is given by 
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where v2, = v2? — ve. If the disk is assumed to be thin and exponential, 


then the left-hand side of this equation is a known multiple of Up(Ro) — 
see equation (2-169) of BT — and the right-hand side can be observationally 
determined. 


10.12 Subject to what boundary conditions should equation (10.61) be 
solved? 


10.13 Given that the halo is spherical and has luminosity 4x 10" Lo between 
r = 0.1kpe and r = 25kpc, show that at Ro the surface-brightness of the 
halo is of order 0.035 La pc~*. Show further that this surface brightness 
corresponds to (25.3+ Ma) mag arcsec~*. Compare this value with a typical 
sky surface brightness. 


10.14 Express the integrals of the last line of equation (10.70) in polar 
coordinates centered on the Galaxy’s systemic velocity and oriented such 
that the LSR’s velocity lies on the polar axis. Hence show that the mean 
azimuthal velocity of stars in the sample is 


Gee fide ae f dO sin 20(x? + 1 — 2x cos 6)3/? 
Vv = 
¢ 2 f dex e3e7 or? fe sin 6(x? + 1 — 2x cos 6)3/2 ’ 


(10.80) 


where a = (v,/20)*. Obtain a similar expression for (v3). 


10.15 Distribute 1000 objects uniformly within the unit sphere and then 
plot their locations in the meridional plane. Explain why contours of equal 
number density in this (R,z) plane are not circularly symmetric. What is 
the significance of this fact for studies of the globular-cluster sub-systems? 


10.16 Show that for any spherical Galactic potential, the escape velocity 
from the solar neighborhood, ve, and the local circular speed, v,, are related 
by 

Wie ee (10.81) 

c— 4/2 . 

with equality in the case that there is no mass at R > Ro. Stars are observed 
that have Galactocentric speeds up to ~ 400kms~! (Carney et al. 1996). 
From this observation, obtain an upper limit on v,. Why might this upper 
limit be misleading? 


11 


Stellar Kinematics in External 


Galaxies 


In §10.3 we have seen that when we study the kinematics of nearby stars 
in the Galaxy we are confronted by a wealth of dynamical information. We 
can measure the full three-dimensional velocities of these stars through the 
combination of Doppler shifts and proper motions, and we can also measure 
their three-dimensional locations via their angular positions and parallaxes. 
Thus, we can recover the full six-dimensional “phase space” coordinates of 
many stars in the Milky Way. The density of stars in this phase space, or 
distribution function (BT §4.1), fully specifies the dynamics of a stellar 
system. 

For external galaxies, we cannot obtain all the data required to recon- 
struct the distribution function directly. On account of their great distances, 
we cannot hope to measure proper motions and parallaxes for stars in ex- 
ternal galaxies, so only line-of-sight velocities and angular coordinates are 
observable. Further, individual stars cannot usually be resolved on account 
of the finite angular resolution in telescopes: even with the imaging capabil- 
ities of HST, the majority of the starlight from an external galaxy still forms 
an unresolved continuum. Hence, observations of starlight from an external 
galaxy can only reveal averages of the stellar properties of the vast numbers 
of unresolved stars that lie along each line of sight. Finally, even the largest 
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telescopes have trouble collecting enough light to measure the kinematics of 
external galaxies with any precision. In this chapter we show that, despite 
these difficulties, a great deal can still be learned about the stellar kinematics 
of external galaxies. 


11.1 Measuring the kinematics of external galaxies 


When measuring the kinematics of individual stars in the Milky Way (§10.3) 
or in globular clusters (§6.1.11), we used the Doppler shift of each star’s 
spectral absorption lines to measure it line-of-sight velocity: if the star is 
moving with line-of-sight velocity v,,, then a spectrum of the star will reveal 
that a feature which naturally occurs at wavelength X will be shifted by 
AX = (vjos/C)A (since vjos < c). In terms of the spectral velocity 


u=clnd, (11.1) 


the shift of a spectral feature is simply the line-of-sight velocity: Au = 
cAX/X = Vos. Hence light that is received at spectral velocity u was emitted 
at spectral velocity u — vIJX0s. 

When light is collected from some point in an external galaxy, its spec- 
trum consists of the sum of the spectra of the unresolved stars along that 
line of sight through the galaxy. However, each individual star will generally 
have a slightly different line-of-sight velocity, so its spectral features will be 
shifted by a different amount Au = 5. The net effect of adding all these 
shifted spectra together will be to produce a composite spectrum in which 
the stellar absorption lines appear shifted and broadened: this phenomenon 
is illustrated in Figure 11.1, which compares the spectrum of a galaxy to 
that of a star. 

To make a quantitative analysis of the shifts and broadening of the ab- 
sorption lines in a galaxy spectrum, we define the line-of-sight velocity 
distribution (LOSVD), F(vs), such that the fraction of the stars con- 
tributing to the spectrum which have line-of-sight velocities between v5 
and Vos + dvjog is given by F (vs) dvos- At any given spectral velocity u in 
the galaxy’s spectrum, we will observe the light from individual stars that 
was emitted at many different spectral velocities; for a star moving with a 
line-of-sight velocity vjos, this light will have originated at a spectral velocity 
of u— jos, and been Doppler shifted to u. If we assume for the moment that 
all of the stars have intrinsically identical spectra, S(u), then the intensity 
received at spectral velocity u from a star with line-of-sight velocity vjs is 
S(u — vjios). Summing over all stars we obtain 


G(u) « J dviceF ()S(u ~ Vjos)- (11.2) 
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Figure 11.1 Spectra of a KO giant star (S) and the center of the 
lenticular galaxy NGC 2549 (G). These data cover a small part of 
the optical spectrum around the strong Mg b absorption feature at 
518 nm. 


As discussed above, a galaxy’s kinematics are encoded in the shapes 
of individual spectral lines. In order to focus on these lines, we separate 
each spectrum into the sum of a slowly-varying continuum component, and 
a rapidly-varying component containing the spectral lines. Thus, we write 
S = Scont + Stine. The convolution integral in equation (11.2) means that G 
is a somewhat smoothed version of S (as Figure 11.1 illustrates). Thus, the 
component of the galaxy spectrum generated by the integral over the slowly- 
varying Scont component must itself be a slowly-varying function, Goont. Sim- 
ilarly, the rapidly-varying component of the stellar spectrum will produce the 
rapidly-varying component of the galaxy spectrum, Giine- Equation (11.2) 
therefore remains valid if we replace S and G by Sjine and Ghine, and it 
is these rapidly-varying components that allow us to investigate the galaxy 
kinematics. Thus, it is customary in stellar-kinematic analyses to remove 
the continua of spectra by subtracting low-order polynomial fits from the 
raw data. Henceforth in this chapter, we will assume that all spectra have 
been pre-processed in this manner. 

Equation (11.2) is the fundamental formula for the study of stellar kine- 
matics in external galaxies. We observe G(u) for a line of sight through a 
galaxy by obtaining its spectrum, and, if we know the type of star from which 
the galaxy is made, we can estimate S(u) using the spectrum of a similar 
star from our own galaxy. Given these two quantities, the most general 


1In practice, the finite resolution of any spectrograph means that lines in the spectra 
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Figure 11.2 The amplitudes of the Fourier transforms of the spec- 
tra illustrated in Figure 11.1, and the amplitude of their ratio. The 
dotted line shows a model fit to the amplitude of equation (11.7), 
with y = 0.8 and ojo, = 160kms~!, and the horizontal bar indi- 
cates the range in k over which the fit was made. 


kinematic property that we can hope to derive for each line-of-sight through 
a galaxy is F(vjo;). Since the right-hand side of equation (11.2) is a convo- 
lution integral, we can isolate the LOSVD by taking the Fourier transform 
of this equation. In fact, we have 


F(k) « 2), (11.3) 


where tilded quantities denote the Fourier transforms of the original func- 
tions. Thus, in principle we can solve directly for F(vjo,) by taking the inverse 
transform of equation (11.3). Computationally, this approach is tempting 
since the requisite Fourier transforms and their inverses can be calculated 
efficiently using the Fast Fourier Transform (FFT) algorithm. Figure 11.2 
shows why this technique fails in practice. The subtraction of the continuum 
from the spectra leads to the low frequency information in the spectra being 
lost, so the transforms are meaningless near k = 0. At high frequencies (large 
k), both G(k) and S(k) are small, so any contribution to G(k) from noise will 
be greatly amplified in the ratio G / S that appears in equation (11.3). Even 


will also be broadened by instrumental effects. However, as long as the star and galaxy 
spectra are obtained with the same instrumental set-up, this extra broadening affects S 
and G equally, so equation (11.2) remains valid. 
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at intermediate values of k, there are frequencies at which § (k) happens to 
be small, so a similar amplification of noise will occur at these points as well. 
Thus, the ratio shown in the bottom panel of Figure 11.2, although it clearly 
contains information as to the form of the LOSVD, has been compromised 
to a point where we cannot simply derive F'(v),,) from its inverse transform. 
To-date, therefore, most dynamicists have adopted less direct methods for 
comparing galaxy and stellar spectra in order to derive at least some of the 
properties of galaxies’ LOSVDs. 


11.1.1 Mean velocities and velocity dispersions 


The simplest properties of a LOSVD are its mean value, 


Vios = | ee.cetesF (11.4) 


and its dispersion oj9,, which is defined by 


o2,, = if dios (Vios — Bios)? F (ios): (11.5) 


These quantities measure the mean streaming and random motions of the 
stars averaged along the line of sight through the galaxy. 

One straightforward approach to estimating these parameters, is to as- 
sume that the LOSVD has the Gaussian form, 


ge OND 
Finod(Vtos|Vios; Phos) ox exp | — (os Pics) (11.6) 
20155 
The Fourier transform of the LOSVD is then 
Froa(k|Btos, Olos) = Y exp [-3 (2no10sk)”] x E27 Piosk (11.7) 


where the parameter + is a normalization factor, which measures the average 
strength of the absorption lines in the galaxy’s spectrum relative to those in 
the star. Equation (11.3) implies that we can solve for the unknown parame- 
ters, Vios and Ojos, by fitting this functional form for the Fourier-transformed 
LOSVD to the observed ratio G/S: the variation in the amplitude of this 
complex quantity with k allows us to estimate y and oj, (see Figure 11.2), 
while its phase specifies U},.,. This technique is known as the Fourier Quo- 
tient Method, and was pioneered by Sargent et al. (1977). Its principal 
disadvantage is that, as we have discussed above, the ratio G / S is plagued 
by large errors that vary from point to point. Thus, it is difficult to decide 
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Figure 11.3 The cross-correlation 
function between the galaxy and 
star spectra shown in Figure 11.1. 
The dotted line shows the auto- 
correlation function of the star. The 
functions have been normalized by 
their peak values. 
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objectively over what range in k the fit should be performed, and how the 
individual data points should be weighted. 

An alternative approach to calculating the mean velocity of a galaxy is 
provided by the cross-correlation method. This technique, pioneered by 
Simkin (1974) and developed further by Tonry & Davis (1979), involves cal- 
culating the cross-correlation function (CCF) between the galaxy and stellar 
spectra, 


CCF (ves) = if dG SG aie). (11.8) 


Since this integral is a convolution, it is most efficiently calculated by means 
of Fourier transforms. A typical example of a CCF is shown in Figure 11.3. 
Since the spectra that produced this figure have had their continuum levels 
subtracted, each spectrum consists of a function that fluctuates to positive 
and negative values around a mean of zero. For a random value of vj, the 
integrand on the right-hand side of equation (11.8) is therefore as likely to 
be negative as positive, so CCF (vs) is small. However, when vios is equal 
to Ujos, the arguments for G and S on the right-hand side of equation (11.8) 
will be such that absorption features in the two spectra will be “lined up,” 
so the integrand will be the product of either two negative or two positive 
numbers, and the integral will evaluate to a large positive value, producing 
the strong peak in Figure 11.3. We can therefore estimate U},., by simply 
finding the maximum value of CCF (vs), a process which has the advantages 
of being objective and readily automated. Since the absorption lines in the 
spectra have finite widths, the features will remain correlated even if viog is 
not exactly equal to Uj.s, so the peak in CCF (vs) has a finite width around 
Dios. The width of this peak depends on how broad the lines in the galaxy 
spectrum are, which in turn depends on ojg,. We can therefore derive ojos 
by measuring the width of the peak in CCF (vis). 
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Figure 11.4 Comparison between the continuum-subtracted gal- 
axy spectrum of Figure 11.1 (dotted line) and the model obtained 
by convolving the stellar spectrum in that figure with the best-fit 
Gaussian model LOSVD (solid line). 


Once we have obtained an estimate for the parameters of the LOSVD, it 
is instructive to substitute the corresponding model LOSVD, equation (11.6), 
into equation (11.2) to see how closely the resulting model for the galaxy 
spectrum matches the observed galaxy spectrum. As Figure 11.4 illustrates, 
when such comparisons are made, the fit between model and data is almost 
always found to be formally poor. One reason for this poor fit is our erroneous 
initial assumption that all the stars in the galaxy are identical. In reality, 
galaxies are made up from stars with a range of spectral types, so it is 
unlikely that our single “template” stellar spectrum will have line strengths 
that exactly match the composite galaxy spectrum. This error is known 
as template mismatch. A simple way to assess the importance of this 
phenomenon is to use one of the above methods to obtain Uj, and ojo, from 
a variety of stellar template spectra; if the derived kinematics do not depend 
strongly on the spectral type of the template star, then the results have 
probably not been compromised. A more sophisticated analysis involves 
modeling the galaxy spectrum using a set of broadened template spectra 
from different stellar types. By leaving the relative contributions from the 
different spectra as free parameters, it is possible to produce a composite 
spectrum which matches the line strengths in the observed galaxy spectrum 
quite closely (Rix & White 1992). In principle, this approach might be used 
to measure the stellar composition of external galaxies, but in practice the 
limited quality of the data available leaves the problem under-determined: 
the sum of the spectra from an F star and a K star looks very much like 
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the spectrum of a G star. It is also worth noting that in the one system in 
which we can study the properties of individual stars in some detail — the 
Milky Way — the observed kinematics vary with the spectral type of the star 
(see §10.3.2). It is reasonable to assume that such variations occur in other 
galaxies as well, and so we should really allow the individual stellar templates 
to be broadened by different LOSVDs when producing a composite galaxy 
model. However, such sophistication is not possible with the current quality 
of spectral data. 

A second reason why the fit between the galaxy model and the observed 
spectrum may be poor lies in our assumption that the LOSVD is Gaussian. 
Beyond its mathematical convenience, there is very little justification for this 
assumption. We have already found that the stars in the solar neighborhood 
have distinctly non-Gaussian velocity distributions (see Figure 10.14), and 
we shall see below that there are stellar systems in which the departures from 
Gaussian velocity distributions are even more dramatic. Further, integrating 
along a line of sight through a system in which the velocity distribution is 
at each point Gaussian, but in which the mean velocity and dispersion vary 
with position, will produce a non-Gaussian LOSVD. It should therefore not 
be surprising if a Gaussian model for the LOSVD does not reproduce in 
detail the observed broadening of galaxy spectral lines. 


11.1.2 Analysis of line profiles 


Given the shortcomings of the Gaussian model, we need some way of fitting 
a more general functional form to the LOSVD. Essentially, we must return 
to the solution of equation (11.2) for F(ujos) given G and S. Disentangling 
the LOSVD from this integral equation is an example of an inverse problem 
— see Box 3.1. As we saw in 811.1, although soluble in principle, the presence 
of noise in observed spectra prevents us from performing the inversion from 
G(u) and S(u) to F (vos) directly. A number of methods have therefore been 
developed which help to suppress the effects of noise in the galaxy spectra. 

The simplest approach to this problem involves smoothing the galaxy 
spectrum to reduce the high-frequency power contributed by random noise. 
In the Fourier domain, this process is equivalent to reducing the amplitude 
of the high-frequency terms in G(k), so that the noise in the quotient of 
equation (11.3) is similarly reduced. By making a relatively crude model 
for the contributions to the right-hand side of equation (11.3) from both 
the true LOSVD and the noise, it is possible to devise an optimal filter 
which, when applied to this equation, returns the best-fitting estimate for 
the LOSVD that is smooth. The first attempt to apply this technique to 
a galaxy spectrum was made by Simkin (1974). Her analysis showed that, 
although the method works in principle, it does not provide a magic solution 
to the problem: with the relatively low quality spectra then available, the 
derived LOSVD was still compromised by noise. 
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A related approach uses the relationship between the cross-correlation 
function given by equation (11.8) and the stellar auto-correlation function 
(ACF), which is the cross-correlation of the stellar template spectrum with 
itself: 


KOR Gi i, quScsG2aes) (11.9) 


(see Figure 11.3). It is mathematically straightforward to show that the CCF 
is just the convolution of the ACF with the LOSVD (see Problem 11.1). In- 
tuitively, we can grasp this result by thinking of the galaxy spectrum as the 
sum of a large number of stellar spectra all slightly shifted in velocity. The 
CCF of the galaxy spectrum with the stellar template will thus be the sum 
of a similar number of stellar ACF all slightly shifted in their mean veloc- 
ities, producing in sum a function which is the stellar ACF broadened by 
the galaxy’s LOSVD. We can therefore obtain the LOSVD by deconvolving 
the ACF from the CCF, just as we have previously attempted to deconvolve 
the stellar template from the galaxy spectrum. The principal advantage of 
working with correlation functions rather than with spectra directly is that 
all of the useful information in the correlation functions is concentrated in a 
single peak rather than being divided amongst the broadening of many spec- 
tral lines. In fact, the conflicting information from different spectral lines 
which can arise through template mismatch is rather effectively reconciled 
by forming the correlation function. Since the signal-to-noise ratio of the 
concentrated peak in the correlation function is higher than that of the spec- 
trum as a whole, the amount of filtering required to suppress the noise when 
obtaining the LOSVD is correspondingly reduced. This approach has been 
successfully implemented by Franx & Illingworth (1988) and Bender (1990), 
and has been shown to return reliable estimates for the LOSVDs of galaxies. 

These methods for deriving the LOSVD are essentially non-parametric: 
they make no assumptions about the form that the LOSVD will take. As we 
have seen above, it is simpler to tackle this problem if a functional form is 
adopted for the LOSVD, and its parameters are varied until the best match 
is obtained between the observed galaxy spectrum and the stellar template 
convolved with the LOSVD. The single Gaussian model of equation (11.6) 
is in general not adequate, and so more flexible parameterizations of the 
LOSVD have been suggested. For example, Rix & White (1992) successfully 
modeled the LOSVD of an elliptical galaxy using the sum of two Gaussians 
with different amplitudes, means and dispersions. 

As a more general parameterization of the LOSVD, we can specify the 
function in terms of its moments, 


Hk >= [ev (vI0s a Bios)” F (vos). (11.10) 


By the definition of Tjo5, 41 = 0, and we have already encountered pz = OF 
a measure of the width of the LOSVD. The higher-order moments tell us 
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Figure 11.5 Montage showing how a velocity distribution typi- 
cally changes depending on its shape parameters £3 and £4. The 
central distribution is a pure Gaussian (3 = 0, 4 = 3). All 
distributions have identical values for the Gauss-Hermite param- 
eters Viog (marked as the y-axis for each LOSVD) and ojos. The 
right-hand and top axes are marked with the approximately cor- 
responding values of the quantities h3 and h4 that are defined by 
equation (11.12). 


more about the shape of the LOSVD. Such quantities also have the phys- 
ical motivation that, like the velocity dispersion (BT §4.2), the values of 
these moments in stellar systems are constrained by Jeans equations (see, 
for example, Merrifield & Kent 1990, Magorrian & Binney 1994). 

The higher-order shape parameters are usefully made dimensionless by 
dividing by the appropriate power of the dispersion: 


fe = wn / Oh: (11.11) 


The first of these new shape parameters, £3, is termed the skewness, and it 
basically measures departures from symmetry in the LOSVD: a positively- 
skewed LOSVD would have a long tail that extends to positive velocities, 
while a negatively-skewed LOSVD has a corresponding tail at negative ve- 
locities. The second shape parameter, £4, is the kurtosis, which measures 
symmetric departures from a Gaussian (for which £4 = 3): a more rectangu- 
lar distribution will have £, < 3, while a more centrally-peaked distribution 
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will have £4 > 3. Examples of how the values of 3 and 4 depend on the 
shape of the LOSVD are shown in Figure 11.5. 

Determining just these first few moments does not uniquely specify the 
LOSVD: in order to define the distribution uniquely, we would have to mea- 
sure all the moments pz, k = 2...00. Further, even the first few moments 
cannot be calculated in practice. The weighting by (v — Zjos)* in the inte- 
grand of equation (11.10) means that, as k increases, the integral becomes 
strongly dependent on the form of the LOSVD at velocities far from Vijgg. 
Since the number of stars far from the mean velocity is small, the LOSVD is 
dominated by noise far from @j.,, and the velocity moments calculated from 
equation (11.10) have huge uncertainties. 

Gerhard (1993) and van der Marel & Franx (1993) have come up with an 
elegant method for describing the LOSVD that reduces this problem, whilst 
retaining much of the physical significance of the moment shape parameters. 
They have modeled the LOSVD as a truncated Gauss-Hermite series, 
which consists of a Gaussian multiplied by a polynomial: 


Frgu(vios) « ena f + S ao) ; (11.12) 
k=3 


where w = (vics — U)/o with U and o free parameters. In equation (11.12), 
the h, are constant coefficients and H;,(w) is a Gauss-Hermite function, 
which is simply a polynomial of order k with carefully-chosen coefficients. 
The number of terms required in the sum over polynomials depends on how 
grossly the LOSVD departs from a Gaussian, but, as Figure 11.5 illustrates, 
even truncating the series at n = 4 allows quite a wide range of shapes. By 
truncating the series at this point, the LOSVD can be estimated for any given 
galaxy spectrum by varying the values of 0, o, h3 and h4 in equation (11.12) 
until the convolution of this function with a stellar template best reproduces 
the observed galaxy spectrum [see equation (11.2)]. The optimal fit can be 
found using a non-linear least-squares fitting algorithm [e.g. Press et al. 
(1992) §15.5], and van der Marel & Franx (1993) demonstrated that such 
fitting can quite robustly find the optimal values of the parameters. The 
validity of truncating the series at n = 4 can be tested by extending the 
series to n = 6 and checking that the coefficients hs and hg have values 
consistent with zero to within the errors in the fit. If the LOSVD is close to 
Gaussian, 0 and o will be approximately equal to Vjp, and jos, respectively. 
In the case of a strongly non-Gaussian LOSVD, the parameters 0 and o no 
longer have this physical interpretation — see Problem 11.2. 

The advantage of couching the term in square brackets as a sum of 
Gauss-Hermite polynomials rather than simply a generic nth order polyno- 
mial is that hs and h4 convey similar information to that contained in the 
physically-motivated moments £3 and £4: the parameters h3 and &3 describe 
how the distribution is skewed, while h4 and £4, describe its symmetric de- 
partures from a Gaussian (see Figure 11.5). Further, van der Marel & Franx 
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(1993) demonstrated that the use of Gauss-Hermite polynomials is optimal 
in the sense that covariances between the fitted parameters are minimized. 
Thus, although noise in the spectra will affect the derived parameters, there 
will be almost no correlation between the errors in the different parameters 
— a higher-than-average estimate for v will not, for example, be accompa- 
nied by a higher-than-average estimate for h3. The only slight caveat to 
this parameterization is that h4 still depends significantly on the form of the 
LOSVD far from the mean velocity. In spectral terms, it is thus dictated by 
the wings of the absorption lines, whose exact form is sensitive to the manner 
in which the continuum has been subtracted. Hence, the derived value of h4 
can contain a significant systematic error due to the process of continuum 
subtraction. 

Although the Gauss-Hermite series of equation (11.12) provides a model 
that can describe a range of departures in the LOSVD from the canonical 
Gaussian (see Figure 11.5), it works best when those departures are fairly 
modest. As we shall see below, there are stellar systems for which a Gaussian 
is not even a reasonable approximation to the LOSVD. If we wanted to use 
an equation of the Gauss-Hermite form to model such LOSVDs, we would 
have to increase the order of the polynomial which modifies the Gaussian. 
Given the uncertainties even in h4 described above, this approach is probably 
not viable. Further, the distributions returned by equation (11.12) will not 
be positive everywhere if the values of h; become large; since the number of 
stars at any given velocity must be non-negative, such negative values of the 
LOSVD are unphysical. 

An alternative model for parameterizing the LOSVD has been suggested 
by Kuijken & Merrifield (1993), who describe the LOSVD by the sum of a set 
of n Gaussians at different predetermined mean velocities 0; and dispersions 
ox, but variable amplitudes a,: 


_ 7, )2 
Fuep (Vos) <a exp |- 2 Se ae vk) | (11.13) 


By placing adjacent Gaussians such that their mean velocities are separated 
by less than 20;,, this unresolved Gaussian decomposition (UGD) of the 
LOSVD can reproduce any distribution that varies smoothly on scales of less 
than o;,. Thus, the procedure for estimating the LOSVD is much as for the 
Gauss-Hermite technique: the amplitudes of the Gaussians, a,, are varied un- 
til Fuap (vos), when convolved with a stellar template spectrum, best repro- 
duces the galaxy spectrum under analysis. The advantage of this functional 
form is that by appropriate selection of the values of 0; and ox, it is possible 
to maintain explicit control on the velocity range over which the LOSVD is 
non-zero, and on the velocity scale on which the LOSVD is smooth. It is 
also simple to calculate other properties of the complete LOSVD in this pa- 
rameterization such as its mean velocity, Dios = >> anUz/ Dax. Since Fucp 
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depends only linearly on the unknown parameters, a;,, the best-fitting model 
can be found by a simple application of linear least-squares analysis. Un- 
like the Gauss-Hermite parameterization, the estimates for the parameters 
ay in this analysis will be strongly correlated: if the estimate of one Gaus- 
sian’s amplitude just by chance comes out abnormally high, the estimated 
amplitudes of its near neighbors in velocity will on average be reduced to 
compensate. Thus the analysis of the uncertainties in the properties of the 
LOSVD is complicated for this model since it is necessary to consider the 
covariances between the different estimated parameters as well as their indi- 
vidual uncertainties. 

For the UGD analysis, Kuijken & Merrifield (1993) showed that it is 
possible to force the LOSVD to be positive everywhere (as is physically re- 
quired) by introducing a series of linear constraints on the values of a,. These 
extra constraints turn the least-squares analysis into a quadratic program- 
ming problem, for which efficient numerical algorithms exist. The positivity 
constraint proves to be a very efficient mechanism for suppressing noise in 
the analysis. Without it, the estimated LOSVD oscillates back-and-forth 
about zero at velocities where the true LOSVD is close to zero. However, 
by stopping the function from dipping below zero, the fitting process is pre- 
vented from following the noise features which introduce this oscillation. The 
price that we have to pay for this suppression is a slight bias in the result- 
ing LOSVD. To see that this is the case, consider a line-of-sight velocity at 
which there are no stars in a galaxy, so that F(vjos) = 0. If we obtain a series 
of spectra of this galaxy, and estimate the LOSVD at this velocity, we will 
obtain values close to zero, but with some error due to noise in the spectra. 
However, the positivity constraint means that each estimate will be zero or 
positive, and thus the mean estimate will be greater than zero. The estimate 
is thus biased in the sense that its mean value differs from the true value of 
the LOSVD. This trade-off between noise suppression and bias is a common 
statistical dilemma, but in this application noise suppression is very efficient, 
while the bias introduced is modest. 

The methods described above by no means exhaust the list of techniques 
that have been developed to estimate stellar LOSVDs from absorption-line 
data. Some methods have been tuned to solving particular problems in 
galaxy dynamics, while others enforce only the most general constraints of 
positivity and smoothness on the derived LOSVD. However, the bottom line 
with all these techniques is that unless the galaxy spectra are of sufficient 
quality to contain usable information on the shape of the LOSVD, no amount 
of clever analysis will reveal the details of the LOSVD. Fortunately, with the 
advent of large telescopes and high-throughput spectrographs, it has now 
become possible to routinely obtain the requisite high-quality spectra. 


11.1.3 Position-velocity diagrams and data cubes 


Most astronomical spectrographs used for obtaining kinematic observations 
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of galaxies admit light through a long, narrow slit. This light is then dis- 
persed in the direction perpendicular to the long dimension of the slit. The 
detector illuminated by the dispersed light thus records a two-dimensional 
image in which the signal received at a given (X,Y) on the detector cor- 
responds to the intensity of light from a point X along the slit that was 
emitted in a narrow range of wavelengths centered on \(Y). By taking a cut 
through this two-dimensional function at fixed X, we can extract the spec- 
trum emitted by the galaxy at a single position X along the slit, and can 
apply the above analysis to derive the corresponding LOSVD, F(X, vos). If 
we normalize this function by the total intensity of light at each point X 
along the slit, we obtain the quantity PV(X, vos) = I(X) x F(vios, X). If 
the galaxy is made up from similar stars at all positions, then PV(X, vos) is 
proportional to the phase density of stars as a function of position along the 
slit and line-of-sight velocity. A plot of the function PV(X, vjo,) is known 
as a position-velocity diagram, or PV diagram. Clearly, a cut through 
this function at constant X yields the LOSVD at that point on the slit, while 
integrating this quantity over vjos yields the surface brightness at that point. 

As we have discussed above, successful derivation of a LOSVD from a 
spectrum relies on our belief that the LOSVD should be a fairly smooth 
function of velocity. Since the kinematic properties of galaxies are generally 
found to vary on lengthscales comparable to their photometric scales (see 
below), we also expect a galaxy’s LOSVDs to vary slowly with position. We 
should thus expect PV(X, vjos) to be a smooth function in both dimensions. 
This expectation has been exploited by Prada et al. (1996): instead of deriv- 
ing separate LOSVDs for each point along the major axis of a disk galaxy, 
they solved for them all simultaneously, essentially constructing the full PV 
diagram subject to the constraint that PV(X,vjos) should be a smoothly- 
varying, positive function. This end is achieved by generalizing the UGD 
algorithm described above, so that PV(X, vjos) is modeled by the sum of 
a set of bivariate Gaussians that are placed at different mean values of X 
and vjo, such that the Gaussians are unresolved in both dimensions. The 
extra smoothness imposed in the spatial dimension provides extra noise sup- 
pression in the extraction of velocity information, resulting in smaller errors 
in the derived kinematics. Merrifield & Kuijken (1994) have shown that it 
is also possible to use the full two-dimensional data set simultaneously in 
order to find a full dynamical model which satisfies the equations of stellar 
dynamics (BT Chapter 4) whilst producing the best fit to all the spectral 
data. In this case, noise suppression is achieved by forcing the kinematics to 
follow a consistent dynamical model. 

Clearly, for any galaxy that appears extended in two dimensions on 
the sky, we can extract PV diagrams for different cuts through the galaxy, 
ultimately producing a complete data cube for the number of stars as a 
function of two spatial coordinates and line-of-sight velocity. With techni- 
cal developments such as fiber-fed spectrographs which can simultaneously 
obtain spectra from a two-dimensional array of points, such data cubes will 
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become increasingly straightforward to obtain. These data sets are directly 
analogous to the HI data cubes that we have discussed in Chapter 8. Note, 
however, that in the case of HI, the gas is usually restricted to a thin layer, 
so, except for edge-on systems, the line of sight only intersects the gas in a 
small region. For a rounder stellar system such as an elliptical galaxy, each 
spectrum samples the kinematics of stars at many points along the line of 
sight. Further, the collisional nature of gas means that all such material at 
a particular point will be moving at approximately the same velocity. Stars, 
on the other hand, behave as an effectively collisionless medium, so the stars 
within a small spatial region can have a wide variety of velocities. Thus, the 
interpretation of a stellar data cube is more complicated than that of the cor- 
responding gaseous data. Nevertheless, we shall see below that studies of the 
detailed stellar kinematics of galaxies as derived from absorption-line spectra 
are opening up literally a new dimension in the study of galaxy dynamics. 


11.2 The stellar kinematics of elliptical galaxies 


We begin our discussion of stellar kinematics by looking at elliptical galaxies. 
Since these systems have relatively simple morphologies, we might reason- 
ably hope that their kinematics admit correspondingly simple interpreta- 
tions. Further, the absence of dust in ellipticals means that the observed 
kinematics will not be complicated by the effects of obscuration. On the 
down-side, the absence of gas in these systems means that we cannot use 
this simple kinematic tracer to study the structure of ellipticals. Since in 
this section we are concerned with the internal dynamics of galaxies, we cite 
all velocities relative to the systemic velocity, sys, of the galaxy under con- 
sideration; that is, all quoted values of Uj.; are the result of subtracting Usys 
from the measured heliocentric velocity. 


11.2.1 Large-scale properties 


Major-axis kinematics The simplest measurements that we can make 
of the large-scale kinematics of an elliptical galaxy come from the analysis of 
spectra obtained from a long-slit spectrograph with the slit aligned along the 
major axis of the galaxy. As we have seen in §4.3, the outer parts of ellip- 
tical galaxies are very faint, so until quite recently it has only been possible 
to obtain crude estimates of their large-scale stellar kinematic properties by 
measuring Dios and oj, (§11.1.1) as a function of position along the major 
axis. However, with a large telescope, a modern spectrograph, and enough 
patience, it is now possible to obtain spectra of high enough quality to quan- 
tify the LOSVD more fully. As we shall see below, the distribution of stellar 
velocities in an elliptical is generally reasonably close to Gaussian, so spectra 
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Figure 11.6 The large-scale major-axis kinematics for a sample of four giant elliptical 
galaxies. The LOSVDs of these systems have been parameterized using the truncated 
Gauss-Hermite expansion (§11.1.2). The dotted lines indicate the effective radius, Re, for 
each galaxy. The Hubble classification (shown in parentheses) is based on the galaxy’s 
average ellipticity outside R-/2. [After Carollo et al. (1995)] 


of these systems are well suited to analysis by the Gauss-Hermite technique 
described in §11.1.2. 

Figure 11.6 shows a montage of major axis kinematic parameters, which 
have been derived out to beyond the effective radii for a sample of typical 
large elliptical galaxies. The first point that is evident from this figure is that 
the various kinematic parameters display an overall symmetry (or antisym- 
metry) about the center of each system. This symmetry implies that there 
is a largescale coherence to the kinematics of these systems, so we may hope 
that simple equilibrium models can explain their current dynamical state. 
Further, the smooth variations between parameters derived from adjacent 
independent spectra provides us with some confidence that the analysis is 
returning meaningful estimates of properties of the LOSVD. 

The second feature that is apparent in Figure 11.6 is that in most cases 
Vlog Changes sign as we pass through the center of the galaxy. Thus, these 
systems display net rotation about their minor axes. This finding is con- 
sistent with the simple idea that rotation tends to flatten systems. If the 
motions of the stars in an elliptical were entirely random, there would be no 
preferred axis in the system, and the galaxy would be spherical. However, 
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Figure 11.7 Values of (v/o)* for a sample of spheroidal systems 
as a function of their absolute blue magnitudes, Mg. Elliptical 
galaxies are shown as circles, with unfilled circles indicating dwarf 
spheroidal systems. The bulges of disk galaxies are shown as circles 
with lines through them. In a number of cases no rotation was 
detected, so only upper limits are quoted for (v/o)*. [From the 
data published in Davies & Illingworth (1983), Davies et al. (1983) 
and Bender & Nieto (1990)| 


when there is some systematic rotation in addition to the random velocities, 
the rotational axis picks out a preferred direction in the galaxy and it is 
natural to expect the galaxy to be flattened parallel to this axis. The degree 
to which the system is flattened is then dictated by the balance between the 
rotational velocity and the random motion, oj... However, inspection of Fig- 
ure 11.6 shows that this cannot be the full story. Three of the galaxies have 
very similar ellipticities (as indicated by their Hubble classifications), and 
comparable random velocities, oj.;, yet the amplitudes of their rotational 
velocities differ dramatically. 


In fact, the amount of rotation required to explain the observed flatten- 
ing of elliptical galaxies is generally much higher than the observed rotation 
speed. To make this point quantitatively, a galaxy is usually characterized 
by Umax, the maximum observed value of Uj,.,, and oo, the average value of 
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Table 11.1 Values of (Wmax/o) as a function of ellipticity for the 
rotationally-flattened model 


Umax/o € Dmax/o 
0.688 0.475 0.941 
0.350 0.727 0.500 0.989 


0.050 0.228 

0.075 0.283 0.375 0.767 0.525 1.040 
0.100 0.331 0.400 0.809 0.550 1.093 
0.125 0.375 0.425 0.851 0.575 1.150 


1.211 


Olos IN some small region close to the galaxy’s center.? Since the amount of 
flattening that we would expect due to rotation depends on the balance be- 
tween ordered and random motions, we then measure the quantity Umax/0o- 
If the galaxy were truly flattened by rotation, a simple relationship would 
hold, essentially independent of the galaxy’s inclination, between Umax/0o 
and the observed ellipticity of the galaxy (BT §4.3). Table 11.1 presents this 
relation. The rotation of elliptical galaxies is conventionally characterized 
by the ratio / 
eee Umax/ 70 

(v/a) = CREE SP, (11.14) 
where the denominator is the value of Umax/oo that one obtains from Ta- 
ble 11.1. Thus, we would expect (v/o)* ~ 1 if a galaxy is flattened 
by rotation. In reality, for bright elliptical galaxies, (v/o)* is substan- 
tially less than unity (see Figure 11.7), so the flattening of these galax- 
ies cannot be attributed to rotation. Interestingly, Figure 11.7 shows that 
intermediate-brightness (-20 < Mg < —18) elliptical galaxies lie much 
closer to (v/o)* = 1 than their brighter kin; these systems do seem to be 
substantially flattened by their rotation. However, studies of the faintest el- 
liptical galaxies reveal that this trend does not continue. For compact dwarf 
ellipticals, these studies are made using a long-slit spectrograph in the same 
manner as for the giant galaxies, but for the very diffuse dwarf spheroidal 
galaxies, the kinematics can only be measured by obtaining the velocities 
of the individually-resolved stars. Analyses of the kinematics of these sys- 
tems shows that, although flattened, many of them rotate very slowly if at 
all (Bender & Nieto 1990); as Figure 11.7 shows, (v/c)* is clearly less than 
unity for these systems. There is also no strong evidence from this plot for 
any kinematic distinction between the diffuse dwarf spheroidal systems and 
other more compact faint galaxies. However, the numbers of faint galax- 
ies whose kinematics have been measured is still small, and better statistics 
would be required to make a rigorous comparison. 


2 Note that this value can depend sensitively on the region over which the averaging is 
performed (see Figure 11.13), so it is not really a well-defined measure of random motions. 
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Evidence as to the nature of the rotational motion in elliptical galaxies 
comes from inspection of the skewness parameter, h3. Galaxies with sub- 
stantial rotational velocities tend to have significantly skewed LOSVDs, in 
the sense that hg has the opposite sign to Vics (see Figure 11.6). The simplest 
way to produce such a combination of Vigg and hg is if the LOSVD consists of 
two components, one of which is a non-rotating broad distribution, and the 
other of which is a rapidly-rotating narrow distribution. This combination 
will produce both a non-zero value for Zj9, and a skewed total distribution 
with the appropriate sign for hg (see Figure 11.5 and Problem 11.5), and can 
reproduce the LOSVDs of rapidly-rotating elliptical galaxies [Cinzano & van 
der Marel (1994), and references therein]. Support for this two-component 
picture of ellipticals comes from the strong correlation between (v/c)* and 
the shape of a galaxy’s isophotes that was mentioned in §4.3.3: elliptical 
galaxies with “disky” isophotes (a4 > 0) have systematically larger rotation 
velocities than those with more rounded or boxy shapes (see Figure 4.39). 
A plausible explanation of this correlation is that many elliptical galaxies 
harbor a rotationally-supported disk component. The size of this compo- 
nent relative to the rest of the galaxy then dictates both the departure of 
the isophotes from ellipses and the kinematics along the major axis. 

It is not clear whether this simple picture of distinct disk and elliptical 
components reflects the true dynamical state of elliptical galaxies: it is pos- 
sible that the ‘disk’ of an elliptical galaxy is just the tail end of a continuous 
spectrum of populations that extends from a truly ellipsoidal, slowly rotat- 
ing population to a rapidly spinning and highly flattened population. This 
ambiguity is closely analogous to the one that we encountered in our study of 
the thin and thick disks of the Milky Way (§10.4.3). Whatever the resolution 
to this question, it is apparent that disk-like properties will be most strongly 
manifested along the major axis of the galaxy, so the above simple analysis, 
in which the value of (v/o)* obtained from major-axis data is compared to 
the overall shape of the galaxy, is likely to be misleading. It is only by fitting 
dynamical models which allow for the observed complexity of structure in 
both the kinematics and photometry that we can hope to reveal the intrinsic 
properties of elliptical galaxies. 

Finally in this section, we turn to the kurtosis parameter of the major- 
axis stellar kinematics, h4 (see Figure 11.5). It is apparent from Figure 11.6 
that h4 is detectably non-zero on the major axes of many ellipticals, tending 
to be positive within their effective radii, R,, and zero, or slightly negative, 
outside R,. The reason for this transition is not clear, and there are also 
exceptions to this general rule — Figure 11.6 also shows the case of NGC 5018, 
in which h,4 remains positive to well beyond R,. At present, there are few 
elliptical galaxies on which sufficient care has been lavished to measure hy 
beyond Rg, so it is not possible to draw any general conclusions from these 
subtle variations in the shape of the LOSVD. What is clear, however, is that 
beyond R, both hy and hg are generally quite small, indicating that the 
LOSVD does not depart dramatically from a Gaussian. This observation 
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has an important bearing on the question of whether elliptical galaxies are 
surrounded by dark halos like their spiral kin (§8.2.4). 


Detection of dark halos As we have seen, the kinematics of giant ellip- 
tical galaxies tend to be dominated by random motions. The amplitude of 
these random motions is dictated by the gravitational potential of the galaxy. 
The velocity dispersion and its variation with radius should thus provide us 
with information on the distribution of mass in elliptical galaxies. Perhaps, 
therefore, we can use stellar kinematics to provide evidence that elliptical 
galaxies possess massive dark halos. We face two problems when trying to 
find and interpret such evidence. First, the distribution of mass at large radii 
primarily affects the observed kinematics at similar projected radii. Since the 
outer parts of elliptical galaxies are very faint, it is difficult to obtain spectra 
of adequate quality to constrain the kinematics at these large radii. The sec- 
ond, more fundamental, problem is that a priori we do not know anything 
about the orbits that stars follow in elliptical galaxies. A poor fit between 
the observed kinematics and a dynamical model with no dark matter might 
imply that a massive halo is required, but it could also arise because the 
model contains the wrong distribution of stellar orbits. Neither of the above 
difficulties is encountered in the case of disk galaxies: 21-cm observations 
allow us to trace the kinematics of the ubiquitous HI in these systems out to 
large radii; and this gas is known to follow approximately circular orbits, so 
there is little ambiguity as to its orbital distribution. Consequently, it has 
proved much harder to confirm the presence of dark halos around elliptical 
galaxies than around disk systems. 

As is evident from Figure 11.6, the first of the problems outlined above 
is now being overcome: there exist spectra for a number of elliptical galaxies 
that allow their kinematics to be measured out to beyond 2R,. These data 
show that the line-of-sight velocity dispersions of elliptical galaxies decline 
slowly or even remain flat to beyond 2R,. Simple constant mass-to-light 
ratio models for these systems predict that the velocity dispersion should 
decline rapidly outside R,, and the absence of such a decline has been cited 
as evidence for surrounding dark halos. 

Unfortunately, by adopting a more complicated distribution of stellar 
orbits it is possible to reproduce the observed slow decline in the veloc- 
ity dispersion without invoking a dark halo. As Figure 11.8 illustrates, at 
small projected radii, o),, is dominated by the radial component, opr, of the 
velocity-dispersion tensor o, whereas at large radii its tangential component, 
og, dominates. Thus, if the stellar orbits are primarily tangential, so that 
O¢ > OR, it is possible for o},, to remain constant or increase with projected 
radius, even if both op and og are radially decreasing (as we would expect 
in the absence of a dark halo). This ambiguity between the distribution of 
stellar orbits and the distribution of mass in elliptical galaxies is discussed 
in more detail in BT §4.2. 

Happily, we can distinguish between the effects of orbital anisotropy 
(og # oR) and those of a dark halo by looking at the shape of the LOSVD. 
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Figure 11.8 Schematic diagram of 
the variation with radius of the ran- 
dom motions in the equatorial plane 
of an elliptical galaxy. Dashed lines 
mark two lines of sight. Projections 
of the velocity ellipsoid (Box 10.2) 
at various points in this plane are 
represented by shaded ellipses. Note 
that although both og and og de- 

} crease with radius, the line-of-sight 

' ' dispersion actually increases as we 

! move out in projected radius from A 


AS B to B. 


In a system dominated by tangential motions, many stars will be moving at 
close to the local circular speed, v., leading to strong concentrations in the 
LOSVD at +v,. At its most extreme, these concentrations would produce a 
double-peaked LOSVD, but even in less extreme cases it will generally result 
in a distribution that has a significantly lower kurtosis than a Gaussian, so 
that £4 < 3 (see Figure 11.5). The small observed values of |h4| at large radii 
in elliptical galaxies (see Figure 11.6) effectively rule out the possibility that 
the slow decline in o;,.5(R) can be attributed to orbital anisotropy, and thus 
provide the best stellar-kinematic evidence that these systems are embedded 
in massive dark halos. 


Kinematic mapping Valuable information can also be obtained from 
elliptical galaxies’ kinematics away from their major axes. For example, 
Figure 11.9 shows the major- and minor-axis kinematics of the E3 galaxy 
NGC 4406. Except very near the center, the rotation velocity about the 
minor axis, Umin, exceeds the major-axis rotation velocity, ¥maj. When this 
phenomenon was first discovered, it was interpreted as evidence that the 
galaxy is a prolate, spindle-like system which rotates about its longest axes 
(Davies & Birkinshaw 1986). 

In NGC 4406 the kinematic minor axis (§8.2.4) is nearly perpendicular 
to the photometric minor axis. Whenever these two axes are significantly 
misaligned there is little justification for assuming that the galaxy under 
observation is intrinsically axisymmetric. When a system is triaxial, the 
kinematic and photometric minor axes are liable to be misaligned for two 
reasons: first, the photometric minor axis usually does not coincide with the 
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Figure 11.9 Major- and minor-axis kinematics of the E3 galaxy 
NGC 4406. [From data published in Franx, Illingworth & Heckman 
(1989b)] 


Figure 11.10 The distribution of 
the estimated kinematic misalign- 
ment angles for a sample of elliptical 
galaxies. [From the data published 
in Franx, Illingworth & de Zeeuw 
(1991)] 


projection on to the sky of any of the galaxy’s three principal axes (§4.2.3); 
second, in three dimensions the rotation axis of a triaxial system can lie 
anywhere in the plane containing the galaxy’s longest and shortest principal 
axes.? Given adequate two-dimensional velocity information, we can measure 


3 The freedom in the orientation of the rotation axis arises because the system’s angular 
momentum is the sum of contributions from long- and short-axis tube orbits — see BT §3.4. 
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the angle Y between the kinematic and photometric minor axes — W is called 
the projected kinematic misalignment. Figure 11.10 shows a plot of the 
estimated kinematic misalignment, U,., = tan~!(vmin/Umaj); Where the 
velocities are measurements of U}., at some fiducial point on the minor and 
major axes. Simple models of the pattern of mean streaming in a galaxy 
suggest that W.., provides a reasonable estimate of & (Franx, Illingworth 
& de Zeeuw 1991), and since spectra have been obtained along the minor 
and major axes for quite a number of systems, V.., has the merit of being 
measurable for a reasonable sample of elliptical galaxies. Figure 11.10 is a 
histogram of values of Y.., for a sample of 44 galaxies. The highest density 
of objects occurs near the value VU... = 0 characteristic of axisymmetric 
systems, but a significant fraction of galaxies have values of V.., that are 
significantly greater than zero. This finding strongly suggests that at least 
some ellipticals are triaxial. 

In §4.3.3 we encountered photometric evidence that elliptical galaxies are 
triaxial, but we found that observations of just the projected distribution of 
light in these systems are not sufficient to specify uniquely the distribution of 
their three-dimensional shapes. Does the extra information provided by the 
distribution of N(W..¢) give us a better handle on the shapes of ellipticals? 
Unfortunately, this dynamical approach also introduces a new unknown into 
the analysis: as mentioned above, the angular momentum of the galaxy need 
not align with any of the galaxy’s principal axes, so we must also specify 
the intrinsic kinematic misalignment, which is the angle between the 
angular momentum vector and the direction of the galaxy’s shortest axis. On 
account of this extra free parameter, the observed kinematic misalignment 
does not provide much new information on the shapes of ellipticals, and a 
wide range of shape distributions is found to be consistent with both the 
projected kinematic misalignments and the projected ellipticities of these 
systems (Franx, Illingworth & de Zeeuw 1991). 

Perhaps we can obtain a deeper understanding of the intrinsic shapes of 
elliptical galaxies from more extensive kinematic observations. For example, 
Statler (1994) has explored the possibility that the shape of individual galax- 
ies can be estimated by supplementing the major- and minor-axis rotation 
data with mean line-of-sight velocities along several intermediate position 
angles. This approach seems to work well for simple triaxial galaxy models, 
but it produces rather disappointing results when applied to more realistic 
models generated by N-body simulations. 

Since mean streaming motions make only a small contribution to the to- 
tal kinetic-energy budget of ellipticals, it might be more informative to look 
at the dominant random motions away from the major and minor axes. This 
possibility could be explored by producing a kinematic map of ojos(X,Y) 
along all lines of sight through an elliptical. Ultimately, we could even con- 
struct the complete LOSVD at all points across the system’s face. We could 
thus observe the density of stars as a function of three variables (two spatial 
coordinates and line-of-sight velocity) — the kinematic data cube discussed 
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in §11.1.3. The stellar dynamics of a galaxy are fully specified by its dis- 
tribution function, and this function also depends on only three variables 
{the isolating integrals of motion — see BT §4.4.1). Thus, there is at least 
some hope that we could transform the kinematic data cube directly into 
the distribution function, allowing us to obtain a complete understanding of 
the structure of an elliptical galaxy from its observable properties. However, 
this hope has yet to be realized. 


11.2.2 Core properties 


Decoupled cores The kinematics of elliptical galaxies are not infre- 
quently strange at small radii. Figure 11.9 illustrates this phenomenon with 
the case of NGC 4406. As we have already discussed, the main body of 
NGC 4406 rotates about its photometric major axis, but there is a strong 
antisymmetric signal in the major-axis data, and this signature implies that 
the core of this galaxy rotates about its minor axis. Thus, this system con- 
tains a kinematically-distinct core (KDC), whose angular momentum 
vector lies perpendicular to that of the bulk of the galaxy. Strangely, the 
lower panels of Figure 11.9 show no corresponding peculiarities in the veloc- 
ity dispersion. Figure 11.11 shows the kinematics of IC 1459, which presents 
an even more extreme example of a KDC. In this otherwise-normal giant E3 
galaxy, the mean velocity on the major axis changes sign at ~ 10 arcsec from 
the center, implying that the stars at these small radii rotate in the oppo- 
site direction to the main body of the galaxy. Studies reveal that KDCs are 
fairly common, occurring in maybe a quarter of all ellipticals (Franx, Illing- 
worth & Heckman 1989b). Kinematic mapping of these systems also shows 
that almost all KDCs show no projected kinematic misalignment: with the 
exception of NGC 5982, in which the core rotates about the photometric ma- 
jor axis (Wagner, Bender & Mollenhoff 1988), all the detected KDCs rotate 
about their galaxies’ minor axes. 

KDCs rotate rapidly, having values of (v/o)* near unity. This find- 
ing makes one suspect that they are small disk components in their hosts. 
Support for this suspicion comes from the shapes of their LOSVDs, which 
generally show significantly skewed shapes [see, for example, Franx & Illing- 
worth (1988) for the detailed kinematics of IC 1459]. As we saw in §11.2.1, 
such skewed LOSVDs will occur when a rapidly-rotating disk with a small 
intrinsic dispersion is superimposed on a “normal” slowly-rotating elliptical 
galaxy. However, high-resolution studies of the photometry of these cores 
using HST do not show the disky isophotes that we might expect from such 
components, so they must be fairly faint (Forbes, Franx & Illingworth 1995). 
Even though the disk component does not contribute much to the overall 
surface brightness of the galaxy, its rapid rotation causes it to dominate 
the observed value of U},., at small radii, whereas its small velocity disper- 
sion means that it does not have much impact on the radial variation in 
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Figure 11.11 The mean line-of-sight velocity as a function of po- 
sition along the major and minor axes of IC 1549. [From data 
published in Franx, Illingworth & Heckman (1989b)} 


Olos- Small disks thus provide an explanation for the peculiar kinematics of 
decoupled cores. 


Detection of central black holes As we have seen in §4.6.2, it is now 
fairly universally accepted that the highly energetic phenomena seen in ac- 
tive galactic nuclei are ultimately powered by matter accreting on to super- 
massive black holes. One question that this conclusion raises is whether 
galaxies that show no signs of nuclear activity are quiescent because they 
are short of fuel, or whether they lack central massive black holes entirely. 
Stellar kinematics may be able to provide the answer: a quiescent massive 
black hole will dominate the gravitational force at small radii in a galaxy, so 
we would expect it to produce a detectable effect on the motions of the stars 
in this region. This possibility, and other techniques for detecting inactive 
galactic nuclei, have been reviewed extensively by Kormendy & Richstone 
(1995). 

A central massive black hole should cause the stellar velocity dispersion 
to increase in its vicinity. Observations of the EO galaxy M87 (whose nuclear 
activity has been assumed to be caused by a massive black hole) do, indeed, 
show an increase in ojo, at small radii (Figure 11.12). It has therefore been 
argued that these data imply that M87 contains a central black hole (Sargent 
et al. 1978). Unfortunately, here we encounter the same problem that afflicts 
the search for massive dark halos discussed above: the observed kinematics 
might also be explained without a black hole by an appropriate orbital ar- 
rangement of stars. Indeed, it has been demonstrated that the observed 
rise in the line-of-sight velocity dispersion at small radii would also occur 
in a system with no black hole if the stellar orbits are predominantly radial 
(BT §4.2; c.f. Figure 11.8, in which the central line-of-sight dispersion is de- 
pressed by having predominantly tangential motions). Thus, measurements 
of the radial variation of o;,, do not alone suffice to reveal unambiguously 
the presence of a central massive black hole. 
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Figure 11.12 The variation in properties of the LOSVD derived 
by applying the truncated Gauss-Hermite algorithm (§11.1.2) to 
spectra obtained close to the center of M87. [From the data pub- 
lished in van der Marel (1994)] 


As in the case of dark halos, this ambiguity can be largely resolved if 
the complete LOSVD is known: radial orbits yield a LOSVD that is sharply 
peaked about the systemic velocity, while a black hole accelerates stars in its 
vicinity to produce a LOSVD with long tails to high velocity. Unfortunately, 
as we have discussed previously, such high-velocity tails are hard to detect 
in studies of stellar absorption lines, since they tend to be suppressed by the 
process of continuum subtraction. 

In practice, the shape of the LOSVD has provided some assistance in the 
search for black holes. In the case of M87, for example, van der Marel (1994) 
has shown that the small departures in the LOSVD from a Gaussian implied 
by the small values of hg and h4 in Figure 11.12 are well-reproduced by a 
model which contains a central black hole of mass 5 x 10? Mg. However, he 
was not able to rule out the possibility that the observed kinematics could be 
explained by a black-hole free model with a mainly radial orbit distribution. 

Rather more success has been achieved in studies of the kinematics of 
more rapidly rotating elliptical galaxies such as M32. Their kinematics are 
easier to interpret because net rotation can only arise from tangential mo- 
tions. Thus, the range of possible stellar orbit distributions is much smaller 
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for rapidly rotating systems than for almost non-rotating systems such as 
M87. The price that you have to pay for this extra information is that these 
systems are not spherically symmetric because rotation picks out a preferred 
axis. This extra complication led van der Marel et al. (1994) to produce a 
fairly complete kinematic map of M32 by aligning a long-slit spectrograph at 
several different position angles across the object. They fitted the resulting 
LOSVDs (derived from the truncated Gauss-Hermite method) to the pre- 
dictions of a number of dynamical models, each at an assumed value of the 
system’s unknown inclination. They were able to produce respectable fits to 
the LOSVDs along all the observed position angles with models containing 
a central mass of ~ 3 x 10® Mo. 

One of the difficulties in detecting a central black hole from stellar kine- 
matics is that it only dominates the mass of the galaxy at small radii, so 
it only influences the kinematics of the innermost parts of the galaxy (see 
Problem 11.4). Thus, black-hole induced phenomena such as enhancements 
in the velocity dispersion will only be visible at small radii. Further, the 
effects of seeing on optical observations (§4.2.2) will contaminate spectra ob- 
tained at small radii with light from larger radii, so the kinematic signature of 
the black hole will appear superimposed on the normal kinematics at larger 
radii, diluting its significance. Spectra from HST have therefore proved a 
boon to the black-hole hunters. Figure 11.13 illustrates the advantage of 
HST’s superior resolution: the amplitude of the enhancement in both jo, 
and ojo, at small radii in M32 is much greater in the higher-resolution obser- 
vations. As we probe to smaller radii, it becomes more difficult to explain 
away the enhanced kinematics in terms of orbital anisotropy, so such HST 
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data provide the best stellar-kinematic constraints on the masses of central 
black holes. 

Despite the convincing arguments that have been made for central black 
holes in a number of stellar systems, the cases are by no means watertight. 
For a start, most analyses to-date have only considered simple two-integral 
distribution functions (BT §4.5.2), whereas in general axisymmetric systems 
will have distribution functions that depend on three integrals of motion 
(BT §4.5.3). Further, the inner parts of ellipticals may not be axisymmetric. 
Hence, while there is no reason to think that sophisticated models that take 
third integrals and triaxiality into account will dramatically change the con- 
clusions we have reached above, we cannot yet make definitive statements 
about the central masses of elliptical galaxies. 

The data shown in Figure 11.13 should also serve as a warning. The 
low velocity dispersion observed in one of the HST observations close to the 
nucleus appears to be a genuine result. The absence of a similar glitch in 
the mean velocity suggests that the deviant point in the dispersion is not 
the result of anything simple like a patch of dust obscuring the nucleus, so 
it is hard to understand how a plausible model of M32 can account for this 
datum. It is very worrying that the only galaxy for which we have data of 
this quality displays such an apparently inexplicable phenomenon. 

Finally, it should be borne in mind that even the best stellar-kinematic 
“detections” of black holes are, in fact, no such thing. Although it is reason- 
ably well established that several galaxies have a central mass concentration 
which cannot be attributed to the normal luminous stellar component, there 
is no direct evidence from the stellar kinematics that this mass must take the 
form of a black hole. In the case of M32, for example, if its ~ 3 x 10° Mo 
central mass concentration were uniformly distributed through the ~ 0.3 pc 
radius region within which the HST observations imply that it is confined, 
then its density would be only ~ 10-1? kg m~3. This astrophysically “ultra- 
high” density is directly comparable to the density of material in the best 
room-temperature vacua attained in terrestrial laboratories; it is 18 orders 
of magnitude smaller than the mean density inside the Schwarzschild radius 
of a 3 x 10° Mg black hole! Clearly, even HST does not probe to anywhere 
near the Schwarzschild radius of the putative black hole, so there is no cast- 
iron guarantee that the mass is in this ultra-compact form. The inference 
that the central mass in an inactive galactic nucleus is in the form of a black 
hole is thus necessarily indirect. It relies on dynamical arguments as to the 
short timescale on which any alternative (such as a dense cluster of neutron 
stars) would itself collapse to form a massive black hole (e.g. Goodman & Lee 
1989). Such indirect reasoning is less satisfying than a true detection, but it 
does mean that Occam’s Razor strongly favors the black-hole interpretation 
of the data. 

In spite these caveats, there are now sufficient numbers of nearby galax- 
ies with central black hole mass estimates for it to be worth seeing if the 
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mass of the black hole correlates with other properties of the galaxy. Fig- 
ure 11.14 shows the mass estimates for the black holes, My, in a num- 
ber nearby galaxies plotted as a function of the absolute magnitude of the 
spheroidal component of the galaxy, Mp buige — its bulge luminosity if it is a 
disk galaxy, or its total luminosity if it is an elliptical. It is heartening that 
there is no discernible systematic difference between the black hole masses 
inferred from stellar kinematics and those inferred from the kinematics of 
nuclear gas disks or masers (see §7.2.4). There appears to be a correlation 
between bulge luminosity and black hole mass, implying, not unreasonably, 
that brighter galaxies contain larger central black holes. However, some care 
must be taken in interpreting such apparent correlations. Larger galaxies 
are relatively scarce, so they will generally lie at larger distances. At these 
larger distances, it would not be possible to resolve the small linear scale over 
which a small black hole influences the stellar dynamics, so only large black 
holes will be detectable. Thus, there will be a positive correlation between 
Mg wulge and detected values of Mpu, even if the quantities are intrinsically 
uncorrelated. However, the observed relation between Mpy and Mg bulge 
appears to be much stronger than can be explained by such selection effects 
(Kormendy & Richstone 1995), so the correlation is probably intrinsic. As 
black-hole mass estimates improve with better data and modeling, and as 
larger samples of galaxies are observed, the demographics of the black-hole 
population will provide the first firm observational basis for testing theories 
of the formation of these bizarre objects. 
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Figure 11.15 Stellar position— 
velocity diagram for the major axis 
of the edge-on SO galaxy NGC 7332 
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grayscale indicates the density of 
stars as a function of both line-of- 
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the major axis. The kinematics 
were derived using the UGD algo- 
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11.3 The stellar kinematics of disk galaxies 


On account of the multiplicity of components that make up disk galaxies, 
the interpretation of the kinematics of these systems is more complicated 
than that of ellipticals. Figure 11.15 shows the PV diagram for the major 
axis of an edge-on disk galaxy. Close to the center of this galaxy, the light is 
dominated by the bulge component, so the PV diagram shows a high density 
of stars which are spread by their random motions to both positive and 
negative velocities (where, once again, we are measuring velocities relative 
to the systemic velocity of the galaxy). At large radii, the signature of the 
rotationally-supported disk is readily apparent in the high density of stars 
traveling at close to the local circular speed. However, as we shall see below, 
there are significant random stellar motions even in the disk. Further, the 
line-of-sight component of the circular speed varies along the line of sight 
through this edge-on galaxy. We therefore find that the populated region of 
the PV diagram for this system has a finite spread in line-of-sight velocities 
even where the disk component dominates. At intermediate radii, there are 
contributions to the PV diagram from both the disk and the bulge, and the 
LOSVDs become quite complex. Indeed, analyses of the full LOSVDs of 
disk galaxies at such radii have shown that these distributions are highly 
non-Gaussian — the observed skewed distributions have been successfully 
decomposed into a rapidly-rotating disk component plus a slowly-rotating 
bulge component [Kuijken & Merrifield (1993), Scorza & Bender (1995); see 
Problem 11.5]. 

The complex kinematic structure of a disk galaxy becomes more appar- 
ent when one obtains a complete stellar-kinematic data cube by mapping 
the LOSVD all across its face (see §11.1.3). Figure 11.16 shows kinematic 
parameters derived from such a data cube in the central region of the edge- 
on disk galaxy M104 (the Sombrero Galaxy). Along the major axis, the 
signature of the disk shows up clearly in both the mean streaming motion 
and the skewness parameter, h3. The difference in sign between U},, and h3 
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Figure 11.16 A kinematic map of 
the central region of M104 (the Som- 
brero Galaxy), showing the variation 
in mean velocity, line-of-sight dis- 
persion, and skewness with position. 
The major axis of the system runs 
horizontally across each panel, with 
the photometric center of the galaxy 
indicated. The range of the grayscale 
is shown to the right of each panel. 
{After Emsellem et al. (1996) cour- 
tesy of E. Emsellem] 


reflects the concentration of stars at close to the circular speed, with a tail of 
stars stretching back towards the galaxy’s systemic velocity (see Figure 11.5, 
and Problem 11.6). Since the disk component is dominated by streaming 
motions, it depresses the values of o},, close to the major axis. Away from 
the plane of the disk, the light comes mostly from the bulge component, in 
which random motions dominate, and so ojos is larger. The enhancement 
in Ojos at very small radii in this galaxy has been successfully modeled by 
Emsellem, Monnet & Bacon (1994) as arising from a central black hole (see 
§11.2.2) with a mass of ~ 10° Mo. 


11.3.1 Bulge kinematics 


The kinematics of a galactic bulge are somewhat harder to determine than 
are the kinematics of an elliptical galaxy: light from the surrounding disk 
contaminates the observable properties of the bulge, and obscuration due 
to dust frequently further complicates the analysis. On the positive side, 
however, the presence of a disk with measurable inclination defines a direction 
which might reasonably be expected to define one of the principal axes of 
the bulge — a piece of information that is generally difficult to obtain in the 
case of elliptical galaxies. 

Observations of the bulge of a disk galaxy that lies reasonably close to 
face-on allow us to measure its central velocity dispersion with little con- 
tamination from the disk. Combining such kinematic measurements with 
the bulge’s gross photometric properties suggests that these systems respect 
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the same fundamental plane as elliptical galaxies (see §4.3.4), and therefore 
satisfy the standard D,, ~ o relation of equation (4.43) (Dressler 1987). This 
similarity between the properties of bulges and ellipticals presumably reflects 
some common aspect of their formation and evolution, but since we do not 
yet have a clear understanding of the nature of the fundamental plane, the 
significance of this discovery is still unclear. 

As we have discussed above, the contribution to the total luminosity 
from the disk increases as we look to larger distances from the center of 
the galaxy. On these larger scales it therefore becomes difficult to isolate 
the bulge kinematics in the PV diagram. One solution to this problem is 
to observe early-type disk galaxies in which the bulge is so dominant that 
disk contamination is small, but the large bulges which can be observed 
by this approach may not be representative. Alternatively, observations of 
the bulges of edge-on galaxies, such as that shown in Figure 11.16, can be 
used: the surface brightness of the disk drops rapidly with distance from the 
galaxy’s plane (§4.4), so we see fairly pure bulge kinematics in regions away 
from the major axis of an edge-on system. 

Kinematic maps derived from observations of edge-on galaxies reveal 
that bulge kinematics vary smoothly with distance from the galaxies’ central 
planes. Consequently, some estimate can be made of a bulge’s major-axis 
kinematics by extrapolating off-axis observations down to the plane of the 
galaxy. We can thus estimate values of Umax and oo for galaxy bulges. Such 
analysis has shown that rotational velocities are, on average, higher for bulges 
than for comparable elliptical galaxies (Davies & Illingworth 1983, Kormendy 
& Illingworth 1982). Figure 11.7 shows that in bulges (v/o)* is close to unity, 
implying that these systems are flattened by rotation. 

Confirmation that bulges can be adequately described as rotationally- 
flattened systems has come from more complete model fitting. After allowing 
for the gravitational influence of the disk component, Jarvis & Freeman 
(1985) have shown that a simple rotationally-flattened model can reproduce 
both the photometric properties of edge-on bulges and the kinematics derived 
from spectra at many locations away from their major axes. 

Study of the kinematics of edge-on bulges using multiple observations 
away from their major axes has also revealed that systems with boxy iso- 
photes (§4.2.2) rotate cylindrically: their mean streaming velocities do not 
decline with distance from their central planes (Kormendy & Illingworth 
1982). Such bulges have been modeled as rotationally-flattened axisymmet- 
ric systems (Rowley 1988), but the properties of boxiness and cylindrical 
rotation can also be ascribed to a prolate bar-like structure (see §4.4.7). 


11.3.2 Disk kinematics 


As we have discussed above, we can only measure uncontaminated stellar disk 
kinematics by looking away from the central bulge region. Unfortunately, 
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Figure 11.17 The mean rotation velocity for both gas (open cir- 
cles) and stars (filled squares) in the disk galaxy NGC 488. The 
gaseous rotation curve is adapted from Peterson (1980a), and the 
stellar data are taken from Gerssen, Kuijken & Merrifield (1997). 
The dotted line shows the circular speed for this galaxy predicted 
purely from the stellar kinematics. 


since the luminosity of a disk drops exponentially with radius (§4.4), these 
outer parts of disks are quite faint. We must therefore either make very long 
observations to obtain the requisite signal in the outer parts of galaxies, or 
try to decompose the complex kinematics at smaller radii into the separate 
disk and bulge contributions. 


Rotational motion Kinematic studies of the disks of external galaxies re- 
veal that, in all but face-on systems, Ujos > Plos at all radii: the stars in other 
disk galaxies, like those in the disk of the Milky Way (§10.3), follow approx- 
imately circular orbits with little random motion. We can therefore apply 
methods developed for calculating rotation curves from gas disks (§8.2.4) to 
the values of U},, obtained from stellar spectra. Generally, the rotation curves 
derived from stellar observations have similar properties to those obtained 
from gas: the stellar rotation increases rapidly with radius near the center of 
the galaxy, and then flattens out to an approximately constant value. The 
similarity between the rotation curves from these two different populations 
confirms that the disk stars follow orbits similar to the circular motions of 
the gaseous component. Note, though, that the outer parts of disk galaxies 
are optically very faint, so it is not possible to measure stellar kinematics 
to the large radii that the HI data allow. Stellar rotation curves do not, 
therefore, provide any evidence for dark halos. 

Figure 11.17 shows the rotation curves of both the gaseous and stellar 
components of the disk of the Sb galaxy NGC 488. The stellar kinematics 
are limited in extent by the faintness of the disk at large radii, and contam- 
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Figure 11.18 Stellar PV diagram 
for the major axis of the Sa gal- 
axy NGC 3593 (as marked on the 
accompanying image). The kine- 
matics were derived using the UGD 
algorithm. [DSS image from the 
Palomar/National Geographic Soci- 
ety Sky Survey, reproduced by per- 
mission; the kinematic data were 
obtained by M. Merrifield and K. 
Kuijken] 


1 Yice/ (km/s) 
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ination by the bulge component for R < 20arcsec. The gaseous kinematics 
contain a gap at radii around 20 arcsec since there are no measured emission 
lines in this region. Where the data do overlap, it is clear that there is a 
small systematic offset between the two rotation curves. This offset is the 
asymmetric drift that we have already encountered in the solar neighbor- 
hood (§10.3.1). It arises because stars do not follow purely circular orbits, 
but have some random component to their motions (BT §4.2). As in the 
Milky Way, the measured asymmetric drift can be used to characterize the 
size of this random motion relative to the ordered streaming (see below). 

As we have seen in §8.2.5, comparisons between the gaseous and stellar 
rotation curves of SO galaxies have turned up the unexpected result that, 
in a significant minority of cases, these species rotate in opposite directions. 
These small counter-rotating gas disks in SO galaxies could well arise from 
the late accretion and tidal disruption of small satellite galaxies. Rather 
harder to explain is the case of NGC 3626, in which as much as 10° Mo of 
hydrogen (atomic and ionized) counter-rotates relative to the stars across a 
wide radial range (Ciri, Bettoni & Galletta 1995). 

The strangest counter-rotating disk phenomenon of them all, however, is 
exemplified by NGC 4550. Absorption features in spectra of this apparently- 
normal edge-on SO galaxy were found to have a double structure (Rubin, 
Graham & Kenney 1992), and analysis of the galaxy’s LOSVDs showed that 
these features arise because 50% of the stars in the disk of this galaxy follow 
approximately circular orbits in one direction, while the other 50% follow 
similar orbits in the opposite direction (Rix et al. 1992).4 A handful of 
similar systems have now been identified. For example, Bertola et al. (1996) 
found clear signs of a small counter-rotating stellar disk in the kinematics of 


4 Such a bimodal velocity distribution provides an example of a case where the kine- 
matics cannot be derived using a method which assumes that the LOSVD is anywhere 
near Gaussian in shape. 
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the Sa galaxy NGC 3593. As Figure 11.18 illustrates, this counter-rotation 
shows up clearly in the galaxy’s stellar PV diagram. Since there are relatively 
few galaxies for which spectra have been obtained of the quality necessary to 
produce PV diagrams, the fraction of galaxies that contain counter-rotating 
disks remains uncertain. However, a study by Kuijken, Fisher & Merrifield 
(1996) indicates that less than ~ 10% of SO galaxies contain even modest 
counter-rotating components, so these objects are probably quite rare. 


Random motions As discussed above, we can infer the presence of non- 
circular stellar motions by observing asymmetric drift. In principle, we can 
obtain a direct measure of the random component of the stellar motions by 
extracting values of oj), from spectra. If we define the usual cylindrical polar 
coordinate system, (R,¢,z), aligned with the axis of symmetry of the disk, 
then spectra of face-on galaxies provide a measure of the z-component of 
the stars’ velocity dispersion, o,, whereas observations of edge-on systems 
allow us to measure a combination of the R- and ¢-components, og and og. 
Of course, all that we can observe directly is the average velocity dispersion 
integrated along the line of sight. Further, the usual techniques for measuring 
kinematics do not allow for the differences that we would expect to find in 
stars of different ages (c.f. the Milky Way, §10.3.2), but rather they produce 
some weighted average kinematics depending on the choice of template star. 
Some care must therefore be taken when converting the observed kinematics 
into localized measurements of the dynamics of disks. 

In practice, even line-of-sight averaged quantities are difficult to observe. 
Face-on disks are intrinsically very faint, and the random motions in the 
z-direction tend to be small. It is therefore difficult to obtain spectra at 
sufficiently high signal-to-noise ratios for 0, to be measurable. For edge-on 
systems, we have seen that the broadening of the LOSVD is dominated by 
the different projections of the rotational velocity that one sees along any 
given line of sight. In order to measure the relatively small contribution to 
the total broadening of the LOSVDs due to the random stellar motions in 
the R- and ¢-directions, we must be able to characterize accurately the total 
width and shape of the velocity distributions. Once again, data obtained 
using high throughput spectrographs on large telescopes are beginning to 
overcome these difficulties. Observations of a dozen or so galaxy disks with 
a range of orientations have allowed us to characterize the variations in the 
components of velocity dispersion with radius — for a summary, see Bottema 
(1993). 

Observations of face-on galaxies show that o, declines exponentially 
with radius. This relation has a simple physical interpretation: from the 
equations of stellar dynamics, it is straightforward to show that, for a thin 
disk, 

o2 = 2nGXzo, (11.15) 


where © is the surface density of the disk, and zo is a characteristic measure 
of the scale-height of the disk perpendicular to the plane (see Problem 11.7). 
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If z) does not vary with radius (as observations of edge-on galaxies suggest; 
§4.4), then o, x VD. We also know that the surface brightness of galaxies 
follows the exponential law, I = Ip exp(—R/Ra) (§4.4). Therefore, if mass 
follows light so that © « I, we would expect o, to decline exponentially with 
radius with an e-folding length of 2Rg. Comparisons between kinematic and 
photometric measurements reveal that o, does, indeed, decline with a scale- 
length of approximately twice that of the surface brightness (Bottema 1993). 
Note, however, that the photometric scale-lengths used in this analysis were 
generally based on optical measurements, and, as mentioned in §4.4, the 
infrared scale-lengths of disks tend to be systematically smaller. Since the 
infrared photometry is more likely to reflect the underlying mass distribution, 
the above simple analysis is not secure. 

Evaluation of the other two components of the velocity dispersion is 
more complicated since both contribute to oj., in observations of inclined 
galaxies, and there is the additional contribution to the broadening of the 
LOSVD from the projection of the mean streaming. Modeling of the ob- 
served values of o},, from edge-on galaxies has revealed that, like o,, og and 
0» decline approximately exponentially with radius with an e-folding length 
twice that of the disk surface brightness (Bottema 1993). We might there- 
fore tentatively conclude that the ratio between the different components of 
random stellar motions remains approximately constant, and hence that the 
shape of the velocity ellipsoid (see Box 10.2) in disk galaxies does not vary 
dramatically with radius. 

Note that the inference that we have drawn as to the constant shape of 
the velocity ellipsoid is rather indirect. We have measured the z-component 
of the random motions from observations of face-on galaxies, and the other 
two components from observations of other systems with edge-on orienta- 
tions. In fact, it is possible to estimate all three components in a single 
a disk-dominated galaxy that has intermediate inclination 7. When we ob- 
serve such a galaxy at some point R from its center along its major axis the 
observed dispersion is given by 

onaj(R) = 0% sin? i + o? cos? i. (11.16) 
The annulus of radius R within the disk cuts the apparent minor axis at a 
distance Rcosi from the center. Here we will find a line-of-sight velocity 
dispersion 


o2 ,,(Reosi) = 0% sin? i + 0? cos” i. (11.17) 
Thus, by measuring the variations in the line-of-sight dispersions along the 
principal axes, we can measure two linear combinations of or(R), o¢(R) 
and o,(R). To close this system of equations, we can turn to the theory of 
stellar dynamics, which shows how dispersion components are related to one 


another. For example, if the epicycle approximation (BT §4.2) is valid, then 


11.3 Kinematics of disk galaxies 729 


: oe 
& 100 dh if i 
d 50 } Pt } 
o /_-+—+-++- +--+ +_+_}+- +++ $44 


i Figure 11.19 The line-of-sight 
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R/arcsec ted to these data. [After Gerssen, 
Kuijken & Merrifield (1997)] 


the radial and tangential dispersions are simply related to the circular speed, 


Ve: 
o 1 dv, 
) c 
SS ia 18 
a3, ;( a i) (1t-18) 


[equation (4-52) of BT, with the Oort constants replaced by the definitions 
of §10.3.3]. The circular speed itself can be obtained from the rotational 
velocities of the stars, corrected for asymmetric drift [see above and BT 
equation (4-34)]. It is thus possible to solve for or(R), og(R), o2(R) and 
Uc(R) just from the observed mean velocity along the major axis, Umaj(R), 
combined with the line-of-sight dispersions, Omin(R) and omaj(R). 

The first application of this technique has been made by Gerssen, Kui- 
jken & Merrifield (1997). Figure 11.19 shows the results of fitting a simple 
model that meets the above requirements to the observed major- and minor- 
axis dispersions of NGC 488 [as well as Ymaj(R) — see Figure 11.17]. A check 
on this model fit is provided by the predictions made for the circular speed, 
v-(R), which can be compared to the observed gaseous rotation velocities — 
see Figure 11.17. The results of this analysis indicate that the components 
of the stellar velocity ellipsoid in NGC 488 are in the approximate ratio 
OR:04:0, © 1:0,8:0.7. Thus, the ordering of these components is the same as 
for the giant stars in the solar neighborhood of the Milky Way (Table 10.3), 
the only other place in which this ratio has been measured. More detailed 
comparisons between the velocity ellipsoids of disk galaxies must await obser- 
vations of larger samples of these systems, and more sophisticated modeling 
of higher-quality data. 
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Problems 


11.1 By taking Fourier transforms of equations (11.2), (11.8), and (11.9), 
show that = eee 

CCF = ACFF, 
and hence that the cross-correlation function can be obtained by convolving 


the auto-correlation function with the LOSVD. 

11.2 A simple modified form of the Gaussian LOSVD is given by 
Frrod(Uios) « (1 + avjos) exp[—(vios — 0)? /207). (11.19) 

If jal < 1/(|d| +c), show that the mean and dispersion of this modified 

distribution are given by 01.5 = 0 + ac? and ojos = o + ad? /o respectively. 


11.3 Calculate the skewness, £3, and kurtosis, £4, for the velocity distribu- 


tions 
05 -l<v<il, 


Fhox(v) = . otherwise; (11.20) 
1—v| -l<v<l, 
Fui(v) = e | otherwise; (11.21) 
and eo ; 
Raye 4 0 Sees (11.22) 


QO otherwise. 


11.4 Ifa galaxy with a flat circular speed curve, v(r) = vc, has a black 
hole of mass Mgy added to its center, show that the observed kinematics 
are likely to be modified out to a characteristic radius Rmax © GMpu/v?. 
For a galaxy with a circular speed v, = 200kms~1 viewed from a distance of 
10 Mpc, what minimum black hole mass should be kinematically detectable 
(a) from the ground, and (b) using HST. 


11.5 The kinematics along the major axis of a nearly edge-on disk galaxy 
can be approximated by combining a population of disk stars all moving at 
velocity v. with a population of bulge stars which have no net rotation and 
a line-of-sight velocity dispersion of v,/ /2. As we look to increasing radii, 
the fraction, f, of bulge stars decreases. Calculate how Ujos, Ojos, €3 and &4 
vary with f. 


11.6 For an exactly edge-on pure stellar disk in which the stars all follow 
circular orbits at speed v,, and in which the density of stars decreases with 
radius in the manner n(r) « r~?, show that the LOSVD is given by 


F (vos) ox { (v2 = ae ie 0 < Vlos < Ve, (11.23) 
0 otherwise. 


Calculate Digs, Flos, €3 and £4 for such a disk. 
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11.7 Assume that a thin stellar disk can be treated locally as if it were an 
infinite plane-parallel sheet whose properties only vary with distance z from 
the disk midplane. In this approximation, the density of stars, v, the vertical 
velocity dispersion, o,, and the gravitational potential, ®, are related by the 


Jeans equation, 
1d(vo?) d® 


Tha Pane (11.24) 
[BT equation (4-36)]. Define a scaleheight, zo, such that 

dIn(vo? 1 

din(voz) 1 (11.28) 


(so that if vo? falls exponentially with height, zp is the usual e-folding length- 
scale). By considering a point that lies far enough from the plane to be above 
most of the mass in the disk, use equation (11.24) and Gauss’ theorem (BT 
§2.1) to derive equation (11.15). 
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Appendices 


Appendiz A: Gravitational deflection of light 


We derive some important results connected with the gravitational deflection 
of light. We assume that the gravitational field is weak and the deflection 
angles are small. We assume, moreover, that the region within which essen- 
tially all the deflection occurs is small compared to the scale of the Universe. 
These restrictions enable us to impose a coordinate system in which the line 
element of general relativity is, to a good approximation, given by (see §9.1 
of Weinberg 1972) 


dr? = (c? + 26)dt? — (1 - 25) (de? + dy? + dz’), (a1) 
c 


where ® is the usual Newtonian gravitational potential. The paths of light 
rays in this metric are most conveniently determined by Fermat’s principle, 
which states that the paths taken by light rays between a source and an 
observer extremize the elapse of coordinate time as photons pass between 
source and observer.! Let (dz,dy,dz) be the change in the spatial coordi- 
nates of a photon in time dt, then since dv has to vanish along a photon’s 


1 Physically Fermat’s principle applies because when the time elapse is not stationary, 
neighboring paths allow the observer to ‘see’ the source at different times when it is in 
different phases of oscillation. These different ‘views’ average to zero. 
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world line, we have from (A.1) that 


~ =(1 _ 2=) ds me 
c ¢? 


where ds = \/dx? + dy? + dz? is the coordinate distance between two points 
on the ray and we have used the weak-field condition |®| < c? to expand 
the square root of the first line. Since ® < 0, equation (A.2) states that in 
our coordinate system light propagates precisely as it would if there were no 
gravitational field but space were filled by a medium of refractive index 


26 
m=1—-5 21. (A.3) 


In view of the result just derived, we now determine the path for which 
the light-travel time 


t= sf ase (A.4) 


Cc 


is stationary with respect to small changes in the path. Since we are inter- 
ested in light paths that are nearly rectilinear, we may orient our coordinate 
system such that one coordinate, say z, increases monotonically along the 
path. When we employ z rather than s as the integration variable, (A.4) 


becomes 
1/2 


da \ 2 dy \ 2 
ct = J aento) ($2) - (=) + | ; (A.5) 
When we vary the value of x(z) along this path by a small amount 6z(z), 
ct changes by 


On dz \ 2 dy \ 2 
fem fede y5rel(az) +(g) +4 
[ae Fe] dz a dz a7 
da\2 dy\2 _)~/? dar déx 
= se Me ee, 
nn) I(S) caer des dz dz 
We now integrate the second term in this integral by parts. Since every path 
must start at the source and finish at the observer, éz vanishes at both ends 


of the range of integration. It follows that the integrated term vanishes. 
Hence 


espe foe EUS) i. (SY * 1 
- 2 (no [(%)*+ (2) 4] "I. 


1/2 


1/2 


734 Appendix A: Gravitational deflection of light 


ee Teeth 62 (die/da 
ecu o=~(dx/ds), 


(dx/ds),=0, 


Figure A.1 The projection onto the rz-plane of a light ray that is 
gravitationally deflected en route from source (at left) to observer 
(at right). 


Along the path x(z) that we seek, 5t vanishes for an arbitrary variation 62. 
Since 6t now involves 6x only as a multiplicative factor in the integrand, it 
can vanish for arbitrary 6x only if its coefficient vanishes for all z. Hence 
along the path we seek we have 


2 (noo [(8)*+ (2) a) ) = BS + (Hy 
(A.8) 


We now integrate both sides of this differential equation with respect to z 
between the source and the observer. Since 


um (2) + (G+) "e0 


frog SP = [a0 (A.9) 


source 


we find 


Both the source and the observer are assumed to lie far from the deflecting 
mass, in regions within which n = 1, so 


Oo 


Figure A.1 shows that the left-hand side of this equation is the angle 
through which the projection onto the xy-plane of the ray from source to 
observer is bent. We define yz, to be this angle and note that equivalent 
relations hold for the yz-plane. Hence the angle between the direction of the 
ray at the source S and at the observer O is given by 


source 


Oo 
a= [ Vinds (A.11) 
s 
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where the integral is along the ray’s path and V_, denotes the derivative 
perpendicular to the path. When we substitute from (A.3) for n, we have 


2 Oo 
a= -3 ViaGds. (A.12) 
cds 


The potential ® is related to the mass-density p by Poisson’s integral 
(e.g., BT §2-1) 


Gia / a ral (A.13) 


When we substitute ans in (A.12) we obtain 


2p as [ ax x'Vi fe 


= ao dx’ pe) mn 


—x’|/8 


(A.14) 


Now, since the deflection angle is small, in es we may approximate the 
integral over s along the true path by the integral over the best-fitting straight 
line through the deflecting region. We orient our coordinate system so that 
this straight lines runs parallel to its z axis and write 


ale) ae xy [ 


Ix — x’|° 


db 
ee 3.7 , oocimgg a 
= + z fa x! p(x’) (x ef (a? + 62)3/2’ 


where in the last line we have made the substitutions 


a? = (x —2')? + (y—y')? =|(x—x’)1|? and b=(z—2'). (A.16) 


(A.15) 


By substituting b = asinhw we may easily show that the inner integral in 
(A.16) has value 2/a?. Consequently, 


en oe = / d?x meats i dz! p(x’) 


=-F fax ea (x',), (A.17) 
= -5V192(x1), 
where 
De) = f ae! o(x’), es 


bx)ee [Px X(x',)In|(x —x’)1|. 


Appendix B. Some important astronomical catalogs 


Catalog 


Principal content 


Number 
of objects 


Stellar survey catalogs 


Bonner Durchmusterung (BD) 
Southern Durchmusterung (BD) 
Cordoba Durchmusterung (CD) 


Cape Photographic 
Durchmusterung (CPD) 


Carte du Ciel 
EBL2 
General Catalog (GC) 


Smithsonian Astrophysical 
Observatory Catalog (SAOC) 


AGK3 
Southern Reference Stars (SRS) 


Second Cape Photographic 
Catalog (CPC2) 


Lowell Proper Motion Survey 
(North and South) 


Large Proper Motion Survey (LHS) 


New Luyten Catalog (NLTT) 


Lick Northern Proper Motion 
Catalog (NPM1) 


Positions and Proper Motions 
(North and South) (PPM) 


Astrographic Catalogue (AC2000) 


Position, magnitude (plus atlas) 
Position, magnitude (plus atlas) 
Position, magnitude (plus atlas) 
Position, magnitude 


Position, magnitude (plus atlas) 
Proper motions for BD, CD and CPD 


Position, proper motion, for all 
stars with V <7 


Position, magnitude, proper motion 


Position, proper motion 

Position, proper motion 

Position (mean epoch 1968, SRS frame), 
proper motion, magnitude 

Position, magnitude, proper motion 

Proper motion 2 0.5 arcsec yr} 


Proper motion > 0.2 arcsec yr~} 


Position, proper motion, magnitude, 
color, 8 < B < 18 


Position, proper motion 


Position (mean epoch 1905, ICRS frame), 
magnitude, B < 13 


Reference 


320 000 
130 000 
580 000 
450 000 


~ 107 
95.000 
33.000 


259 000 


183 000 
20 000 
276 000 


12000 


4500 


59 000 
149 000 


379 000 


5 x 10° 


Argelander (1859-1862) 
Schénfeld (1886) 

Thome (1892-1932) 

Gill & Kapteyn (1895-1900) 


Various publications, culminating in AC2000 
Hamburg Sternwarte (1936) 
Boss (1937) 


SAO staff (1966) 


Heckmann & Dieckvoss (1975) 
U.S. Naval Observatory (1988) 
Zacharias et al. (1992) 


Giclas, Burnham & Thomas (1971-1978) 


Luyten (1979) 


Luyten (1980) 
Klemola, Hanson & Jones (1987) 


Roeser & Bastian (1991), Bastian et al. (1991) 


Urban et al. (1998) 
(http: //aries.usno.navy.mil/ad/ac.htm1) 
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Catalog 


Principal content 


Number 
of objects 


Stellar astrometric catalogs 


Fourth Fundamental Catalog (FK4) Position, proper motion 


Fifth Fundamental Catalog (FK5) 


FK5-Extension 


HST Guide Star Catalog (GSC) 


International Reference 
Stars (IRS) 


Astrographic Catalog 
Reference Stars (ACRS) 


Carlsberg Meridian Catalogs 
(CMCn, n =1...8) 


General Catalogue of 

Trigonometric Parallaxes 
Hipparcos Catalogue 
Tycho Catalogue 


ACT Reference Catalog 


Position, proper motion 

Position, proper motion, across a wider 
range of magnitude and color 

Position, magnitude, 7 << V < 16 

Position, proper motion of AGK3 and 
SRS stars, mostly 7 < mp < 9 

Position, proper motion of AGK3 and 
CPC2 stars in IRS frame 

Position, proper motion, magnitude 
V<15 

Position, proper motion, parallax 
(~ 2300 non-Hipparcos) 

Position (ICRS), proper motion, parallax, 
magnitude, variability, Hp < 13 

Position (ICRS), proper motion, parallax, 
magnitude, color, variability 

Proper motion by comparing AC2000 
and Tycho Catalogues 


Stellar spectral types 


1500 
1500 
3100 


2x 107 
36 000 


320000 
70 000 

8 100 
118 000 
1 x 10° 


989, 000 


Reference 


Fricke & Kopff (1963) 
Fricke, Schwan & Lederle (1988) 
Fricke, Schwan & Corbin (1991) 


Lasker et al. (1990) 
Corbin & Warren (1991) 


Corbin, Urban & Warren (1991) 
Fabricius (1993) 

van Altena, Lee & Hoffleit (1995) 
Perryman et al. (1997) 
Perryman et al. (1997) 


Urban et al. (1998) 
(http: //aries.usno.navy.mil/ad/act .html) 


Henry Draper Catalog (HD) 
Henry Draper Extension (HDE) 
Michigan Catalog of Two- 
Dimensional Spectral Types 
Twelfth General Catalog of MK 
Spectral Classification 


Position, spectral class, magnitude 
Position, spectral class, magnitude 


Position, MK spectral type for HD 
stars 


Position, MK spectral type, 
UBV photometry 


225 000 
134 000 
130 000 


85 000 


Cannon & Pickering (1918-1924) 
Cannon (1925-1936), Cannon & Mayall (1949) 
Houk (1975-) 


Buscombe (1995) 


Appendix B_ continued 
Catalog Principal content Number Reference 
of objects 
Stellar photometry 
UBVRIJKL Photometry of UBVRIJKL Photometry of stars 1500 Johnson et al. (1966) 
Bright Stars in the BSC 
UBVRIJKLMNH Photoelectric UBVRIJKLMNH photoelectric 4500 Morel & Magnenat (1978) 
Photometric Catalogue photometry 
Photoelectric Johnson Photoelectric UBV RI photometry 6 800 Lanz (1986) 
Photometric Catalogue 
Catalogue of ubvy8 Data Photoelectric ubvyZ photometry 63 000 Hauck & Mermilliod (1998) 
UBVRI Photometric Standard Photoelectric UBV RI photometry of 526 Landolt (1992) 
Stars stars on the celestial equator 
CCD Photometric Standards CCD photometry in vgr and UBV 120 Jorgensen (1994) 
General Catalogue of Photometry in 78 photometric systems 208 000 Mermilliod, Mermilliod & Hauck (1997) 


Photometric Data (GCPD) 


IUE Low-Dispersion Reference 
Atlas of Stellar Spectra 


Stellar Spectrophotometric 
Atlas 
Library of Stellar Spectra 
Atlas of Southern MK Standards 
Library of Near-Infrared Spectra 


Spectrophotometric Standards 


Southern Spectrophotometric 
Standards 


(6 systems with more than 10* entries) 


Stellar spectral data 


flux-calibrated spectrum, range of MK types, 161 
115nm < A < 320nm, 0.6—0.9nm resolution 
flux-calibrated spectrum, range of MK types, 175 
313nm < A < 1070nm, 2—4nm resolution 
flux-calibrated spectrum, range of MK types, 161 
350nm < A < 740nm, 0.5nm resolution 
flux-calibrated spectrum, range of MK types, 126 


580nm < A < 1020nm, 0.5nm resolution 


flux-calibrated spectrum, range of MK types, 56 
14pm <r» < 2.5m, 4nm resolution 


High quality flux-calibrated spectrum, 25 
320nm < A < 810nm 
High quality flux-calibrated spectrum, 20 


330nm < A < 1050nm 


(http: //obswww.unige.ch/gcpd/) 


Heck et al. (1984) 

Gunn & Stryker (1983) 

Jacoby, Hunter & Christian (1984) 
Danks & Dennefeld (1994) 

Lancon & Rocca-Volmerange (1992) 
Massey et al. (1988) 


Hamuy et al. (1994) 


continued 


Appendix B 


Catalog 


Third Bibliographic Catalogue 
of Radial Velocities 


Wilson-Evans-Batten Catalogue 
of Radial Velocities (WEB) 


Number 
of objects 


Principal content 


Reference 


Stellar radial velocity 


Radial velocity 44000 


Radial velocity (mean value for binaries) 21000 


General Catalog of Variable 
Stars, 4th Ed. (GCVS4) 


Reference Catalog of Galactic 
Novae 


Sternberg Supernova Catalog 


Catalogue of Galactic 
Supernova Remnants 


Variable stars, novae, etc. 


Barbier-Brossat, Petit & Figon (1994) 


Duflot, Figon & Meyssonnier (1995) 


Position, magnitude, amplitude of 28 000 
variability 

Position, brightest magnitude, rate 283 
of decline 

Position, brightest magnitude, 930 
identification of host galaxy 

Position, type, radio flux 180 


Nearby, bright and high-velocity stars 


Kholopov et al. (1988) 
Duerbeck (1990) 
Tsvetkov, Pavlyuk & Bartunov (1995) 


Green (1991) 


Catalog of Nearby Stars (Gliese 
Catalog, or CNSn, n = 1...3) 


Palomar/MSU Nearby Star 
Spectroscopic Survey 


Yale Bright Star Catalog (BSCn, 


nm=1...5) 
Supplement to the BSC 
Catalog of High-Velocity Stars 


Radial Velocities of High 
Proper Motion Stars 


Position, proper motion, radial velocity, 3 800 
mag., color, distance, for stars within 25 pc 

Spectral type, line strengths, radial 1750 
velocity of CNS3 stars 

Position, proper motion, parallax, radial 9100 
velocity, spec. type, mag., color, V < 6.5 

Extension of BSC4 down to V = 7.1 2600 


Position, proper motion, parallax, radial 630 
velocity, magnitude, color 

Position, proper motion, radial velocity, 890 
magnitude, color 


Jahreiss & Gliese (1993) 

Reid, Hawley & Gizis (1995), Hawley, 
Gizis & Reid (1996) 

Hoffleit & Warren (1991) 


Hoffleit, Saladyga & Wlasuk (1983) 
Roman (1955) 


Fouts & Sandage (1986) 


Appendix B_ continued 


Catalog Principal content Number Reference 
of objects 


Binary stars 


Washington Visual Double Star Position, separation, spectral types and 78 000 Worley & Douglass (1996) 
Catalog 1996.0 (WDS) magnitudes of components 
Fourth Catalog of Orbits of Position, orbital elements, spectral types 850 Worley & Heintz (1983) 
Visual Binary Stars and magnitudes of components 
Eighth Catalog of Orbits of Position, orbital elements, magnitude 1500 Batten, Flechter & MacCarthy (1989) 


Spectroscopic Binary Stars 


Star clusters 


Catalog of Globular Clusters Position, magnitude, color, distance, 146 Harris (1996) (http:// 

in the Milky Way velocity, metallicity, structure www. physics.mcmaster.ca/Globular. html) 
Catalog of Open Cluster Data Position, magnitude, distance, size, 8600 Lynga (1987) 

constituents, classification 

Catalog of UBV Photometry of UBV photometry for stars in 113 37 000 Philip, Cullen & White (1976) 

Globular Cluster Stars clusters 
Database for Stars in Open Photometry, spectroscopy, astrometry, 100 000 Mermilliod (1987—) 

Clusters (BDA) for stars in ~ 500 clusters (http: //obswww.unige.ch/bda/) 
Catalog of Globular Cluster Surface brightness profile, derived 125 Trager, King & Djorgovski (1995) 

Surface Brightness Profiles structural parameters 

Nebulae 

Catalog of Dark Nebulae (LDN) Position, area, opacity 1800 =Lynds (1962) 
Catalog of Bright Nebulae (LBN) Position, size, color, brightness 1100 Lynds (1965) 
Catalog of Reflection Nebulae Position, size, brightness 158 van den Bergh (1966) 
Strasbourg—ESO Catalogue of Position, size, magnitude, colors, properties 1100 Acker et al. (1992) 


Galactic Planetary Nebulae of central star 
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Catalog 


Principal content 


New General Catalogue (NGC) 


Index Catalogue (IC; two 
additions to NGC) 


Morphological Catalog of 
Galaxies (MCG) 


Uppsala General Catalog of 
Galaxies (UGC) 


ESO/Uppsala Survey of the 
ESO(B) Atlas 


Revised Shapley-Ames Catalog of 
Bright Galaxies (RSA) 


Catalogue of Principal 
Galaxies (PGC) 


Third Reference Catalog of 
Bright Galaxies (RC3) 


APM Bright Galaxies Catalogue 
Nearby Galaxies Catalog (NBG) 


CfA Redshift Catalog 

Las Campanas Redshift Survey 

Catalog of Galaxies and 
Clusters of Galaxies (CGCG) 

Catalog of Rich Clusters of 
Galaxies (Abell, ACO) 


Number 
of objects 


Galaxies and clusters of galaxies 


Reference 


Position and description of nebulae, 
galaxies and star clusters 


Position and description of nebulae, 
galaxies and star clusters 

Position, size, form, especially of 
multiple and interacting systems 


Position, magnitude, classification, 
radial velocity (Bz < 14.5) 


Position, magnitude, classification of 
galaxies in the southern sky 

Position, magnitude, form of bright 
galaxies (B < 13) 

Position, cross-identifications to other 
catalogs, other properties 


Position, magnitude, color, redshift, 
classification (B < 15.5) 

Position, magnitude, shape, type 
over 10% of the sky (By < 16.44) 


Position, magnitude, classification of 
galaxies with redshifts < 0.01 


Position, magnitude, redshift 
Position, magnitude, redshift 
Position, magnitude, redshift 


Position, magnitudes of some members, 
redshift, richness class, distance class 


7 800 


6 900 


32 000 


13 000 


18 000 


1200 


73 000 


23 000 


14700 


2 400 


57500 
26 400 
29 400 


5 500 


Dreyer (1888), Sinnott (1988) 

Dreyer (1895, 1908), Sinnott (1988) 
Vorontsov-Velyaminov et al. (1962-1974), 
Nilson (1973) 

Lauberts (1982) 

Sandage & Tammann (1981) 

Paturel et al. (1989) 

de Vaucouleurs et al. (1991) 

Loveday (1996) 

Tully (1988) 


Huchra et al. (1995) 
Shectman et al. (1996) 
Zwicky et al. (1961-1968) 


Abell (1958), Abell, Corwin & Olowin (1989) 
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Catalog Principal content Number Reference 
of objects 
Active galaxies 
First Byurakan Survey of Position, magnitude, size, type, redshift 1500 Markarian et al. (1989) 
Markarian Galaxies (FBS, Mrk) of UV-bright galaxies 
Catalog of Seyfert Galaxies Position, photometry (optical, radio, IR, 960 Lipovetsky, Neizvestny & Neizvestnaya (1988) 
X-ray), redshift 
Catalog of Quasi-Stellar Objects Position, magnitude, redshift 7 300 Hewitt & Burbidge (1993) 
Quasars and Active Galactic Position, magnitude, redshift, type of 11700 Veron-Cetty & Veron (1996) 


Nuclei 


Master List of Radio Sources 


NRAO/VLA Sky Survey (NVSS) 
Catalog 


Faint Images of the Radio Sky 
at 20cm (FIRST) Catalog 


Catalog of Infrared Observations 
Second EUVE Source Catalog 


ROSAT All Sky Survey Bright 
Source Catalog (1RXS) 


ROSAT Source Catalogs (1RXP, 
WGACAT) 


HEAO 1 A-4 Catalog of High- 
Energy X-ray Sources 


Second EGRET Catalog of High- 
Energy Gamma-Ray Sources 


active galaxy 


Non-optical source lists 


Position, flux, frequency of measurement 12000 

Position, size, flux and other Stokes ~ 2x 108 
parameters at 1.4GHz 

Position, size, flux at 1.4GHz 270 000 

Position, IR flux in several bands 11500 

Position, EUV flux in several bands 740 

Position, probability extended, soft 18 800 
X-ray (0.1-2.4keV) flux, hardness 

Position, probability extended, soft ~ 70.000 
X-ray (0.1-2.4keV) flux, hardness 

Hard X-ray (13-180 keV) flux 50 

Position, gamma-ray flux, spectral index 129 


Dixon (1970) 


Condon et al. (1997) 
(http://www.cv.nrao.edu/NVSS/) 


White et al. (1997) 
(http: //sundog.stsci.edu/) 


Gezari et al. (1993) 
Bowyer et al. (1996) 
Voges et al. (1996) 


Voges et al. (1994), White, Giommi 
& Angelini (1995) 
Levine et al. (1984) 


Thomson et al. (1995) 


NOTES: The data for most of these catalogs are available on-line from the Astronomical Data centers. Other on-line references as listed. 
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Appendiz C: Richardson—Lucy deconvolution 


Lucy (1974) described an elegant method for handling inverse problems of 
the form f(z) = f d€K(a, €)g(€), where f and g are observed and theoretical 
distributions, respectively, and K is a smooth kernel. It is important that 
f, g and K are all non-negative functions, for the first step is to normalize 
them such that 


[eet@ = fgo@ = farxe,9 =1 (C.1) 


this can always be done by appropriate rescalings of the functions — see 
Problem 3.9 — and we assume that (C.1) holds. Then each function can 
be interpreted as a probability density: f(x)dz is the probability that the 
measured value of z will fall in (2 + dz,zx), g(€)dé is the probability that a 
randomly chosen system has € in the range (€ + d&,£) and K(z,£)dxdé = 
K(a\|€)dadé is the probability that we measure z in (x + dz, x) given that € 
falls in (€ + dé€, €). 

By the usual rules of probability theory, the probability P(a, €) drdé 
that z and € both lie in the stated ranges, can be written two ways: 


g(f) K (al) dadg = P(a,£) dad = f(x)L(€|x) dadé, (C.2) 


where L(€|x) dzd€ is the probability that € lies in (€ + d&,€) given that z 
lies in (2 + dz, x). Summing P over all possible values of x we recover g: 


a(é) = i ae P(e,¢) = f ae f(e)L(g\e). (C.3) 


So if we knew L, we could immediately recover g. Dividing both sides of 
equation (C.2) by f(x), we obtain a statement of Bayes theorem: 
g(§) 

L(gla) = LK (ale). (CA) 
Since we don’t know g, this equation for L is not immediately useful. But 
Lucy realized that even a crude estimate of DZ can be usefully employed 
in (C.3) because L occurs inside an integral, and proposed the following 
sequence of approximations to L. First we guess g. We call this guess gi 
and calculate the value f;(x) = f d&g: (€)K(z|E) it predicts. Then we use f; 
and g; in (C.4) to form an estimate Li = (g:/f1)K of L, which we use in 
(C.3) to obtain an improved estimate gz of g. From gz we calculate fz and 
Li, and thus form g3 and so on, the general step being 


LAC) 
= d 

Gr+1(€) or (€) z f,(2) 
We always choose the function g;(£) to be smooth, and we find that the iter- 
ates g, become less smooth as r increases, while the predicted distributions 
f(a) become better and better approximations to the measured distribution 
f(a). It is easy to show that g, and f, are always non-negative and correctly 
normalized. 


K(a\€). (C.5) 
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Appendiz D: Useful numbers! 


Physical constants 


Gravitational constant G = 6.672(4) x 10-4 m3 kg7*s~? 
Speed of light c = 2.99792458 x 108 ms~! (by definition) 
Permeability of space Ho = 4x x 1077 Hm™! (by definition) 
Permittivity of space €0 = (uoc?)~} 

= 8.85418 x 1071? Fm? 
Planck’s constant h = 6.62618(4) x 10-4 Js 

h = 1.054589(6) x 10734 Js 

Boltzmann’s constant k = 1.38066(4) x 10772 JK7? 

= 8.61733 x 10-5 eV 
Electron charge e = 1.602189(5) x 10719 coulomb 
Electron volt eV = 1.602189(5) x 10-19 J 
Proton mass Mp = 1.672649(9) x 10-27 kg 
Electron mass Me = 9.10953(5) x 1073 kg 
Bohr magneton Lp = eh/2m,. 

= 9.27408 x 10-74 JT~1 
Stefan-Boltzmann constant o = 17k*/(60h%c?) 

= 5.6703(7) x 10-° Wm-? K~* 


e* / (6me2m2c*) 


= 6.65245(6) x 10-79 m? 


Thomson cross-section or 


Astronomical Constants 


Astronomical unit 1 AU = 1.49597892(1) x 10'1m 
Parsec 1 pc = 648 000/m7 AU 
= 3.08567802(2) x 1016 m 
1 sidereal year (1900.0) lyr = 3.1558149984 x 10’s 
Hubble constant Hp = 100hkms~! Mpc! where 0.5 Sh S1 
Hubble time H5* = 9.78h-1 x 10° yr 
Solar mass Mo = 1.989(2) x 10°° kg 
Gaussian constant G Mo = 1.32712497(1) x 107° m3s~? 
Solar radius Ro = 6.9599(7) x 108m 


Solar luminosity (bolometric) Leo = 3.826(8) x 107° W 


Escape speed from Sun Vy = V2G Mo/ Ro 


= 617.5kms"1 
Solar absolute magnitude My = +4.83 
Mp = +5.48 
Earth mass Mg = 5.976(4) x 1074 kg 


Useful relations 
1kms~' ~ 1pc per million years (actually 1.023) 
1 radian = 206 265 arcsec 


1 Taken from Allen (1973), Lang (1980) and Particle Data Group (1984). Numbers 
given in parentheses indicate one standard deviation uncertainty in the last digits of the 
preceding number. 
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BL Lac objects 249 
hosts 250 
and unified model 253 
black body spectrum 470 
black dwarfs 268 
black holes 191, 271, 273, 723 
in centers of ellipticals 717-721 
at Galactic center 595-597 
in globular clusters 371, 374 
and unified model 255 
blazars 249 
and unified model 253 
blue stragglers 334-335, 359, 361, 367 
blue supergiants 20 
Bohr magneton 744 
bolometric correction 98, 262, 59 
bolometric magnitude 58-60 
apparent 59 
Boltzmann’s constant 744 
boxy galaxies 201 
bremsstrahlung 404, 460, 483 
radio frequency 480 
bright stars 
catalogs of 739 
names 58 
brightness temperature 61, 470 
brown dwarfs 268 
bulge fraction 220 
bulge—-disk decomposition 214~222 
bulges 
boxy 232, 724 
Dn - ¢ relation for 724 
flattening of 724 
fundamental plane for 724 
peanut 232 
shapes of 222-223 
stellar kinematics of 723-724 


C 


C stars ~ see carbon stars 
cannibalism, galactic 445 — see also mergers 
Canterbury Sky Atlas 69 
carbon stars 90, 92, 101, 266 
and Galactic rotation curve 557 
Carina Dwarf Galaxy 171 
Carina arm 561 
Carlsberg Automated Meridian Circle 37 
Carlsberg Meridian Catalogs 737 
Carnegie Atlas 69 
Carte du Ciel 736 
Catalog of Galaxies and Clusters of Galaxies 
741 
Catalog of Nearby Galaxies 741 
Catalog of Nearby Stars 130 
catalogs 67-74 
of y-ray sources 742 
of active galaxies 742 
of binary stars 740 
of clusters of galaxies 741 
digital 69 
of galaxies 741 
Gliese 130 
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IC 67, 741 
of IR sources 742 
of Markarian galaxies 742 
of nebulae 740 
NGC 67, 741 
of non-optical sources 742 
of novae 739 
of quasars 742 
of QSOs 742 
of radio sources 742 
Shapley-Ames 150 
of star clusters 740 
of stars 
astrometric 737 
photometric 738 
radial velocity 739 
of spectra 738 
of spectral type 737 
survey 736 
of supernovae 739 
of variable stars 739 
of X-ray sources 742 
CCD 22, 47, 50, 736 
photometry using 175 
cD galaxies 157, 166, 169, 186, 189, 194 
celestial sphere 28 
poles 28 
ecliptic 29 
Galactic coordinates 30 
great circle 28 
horizon 28 
local vertical 28 
position angle 30 
Cepheids 15, 95, 265, 400 
classical 289 
Oort constants and 642 
period—luminosity relation for 289 
period-luminosity—color relation for 291 
population II 293 
spiral arms and 665 
as standard candles 415 
CfA Redshift Catalog 741 
Chandrasekhar limiting mass 268 
charge coupled device ~ see CCD 
chemical composition, stellar 21 
chemical elements, synthesis of §5.2 
chemical evolution, models of §5.3 
circular speed curves 
Galactic 537, 555-559, 570, 591 
galactic 507-510 
stellar 725 
cirrus 24, 574 
classification 
of stars §3.3 
Harvard sequence 90 
history of 90 
MK system 90-94 
spectral 88-89 
of galaxies §4.1 
DDO system 148 
de Vaucouleurs system 148 
Hubble sequence 149 
Yerkes system 148 
closed-box model 306-308, 688-698 
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clusters 
of galaxies 157, 165-167 
shapes of 405 
catalogs of 741 
Coma 417, 441, 690 
elliptical fraction in 157 
intracluster gas in 532 
isopleths of 189 
luminosity function of 165-167 
morphology-—density relation in 159 
morphology-radius relation in 158- 
159 
substructure in 160 
Sunyaev—Zel’dovich effect in 403 
Virgo 163, 418 
X-ray emission from 404 
of star 8, 103, Chapter 6 — see also open 
clusters and globular clusters 
CM diagram - see color-magnitude diagram 
CNO-cycle 266 
CO bandhead 622 
CO emission lines 474-478 
in disk galaxies 493-501 
in elliptical galaxies 528 
Galactic 554-555, 586-590 
from molecular clouds 477 
optical depth of 476 
correlation with synchrotron radiation 
579 
CO isotopes, abundance ratios 476 
COBE ~ see Cosmic Background Explorer 
color excess 133, 137, 140 
color index 52 
color-color diagram 108 
color-magnitude diagram 20, §3.5 — see also 
asymptotic giant branch, horizon- 
tal branch, main sequence, red gi- 
ant branch, subgiant branch, turnoff 
point 
bounding curves in 274-275 
dependence on metallicity 276-279 
features in 263-267 
of Galactic bulge 621 
of globular clusters 20 
of moving groups 636 
observed 103-104 
of open clusters 20, 381-384 
placing stellar models in 262-263 
sequences in 103 
stellar evolution in §5.1 
theoretical 103 
color-magnitude relation, galactic 206 
colors 52-56 52 
relation to temperature 262 
Coma Cluster 417, 441, 690 
concentration class 331, 330 
concentration 
of globular clusters 366 
of open clusters 386 
confusion 333 
convection 274 
overshoot 261 
Schwarzschild criterion for 260 
in stellar cores 268 
convergent point 40 
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coordinate systems §2.1 
Copernicus Telescope 453 
core 177 
collapse 370 
radius 
of open clusters 386 
of globular cluster 366 
core collapse supernovae 303 
corotation radius 231 
COS-B 601 
Cosmic Background Explorer 24, 486, 570, 
609, 616, 620 
cosmic background radiation 279, 403 
cosmic distance ladder 395, 429 
cosmic deceleration parameter 398, 443-448 
cosmic density parameter 397, 443-448 
cosmic rays 479, 482 
and vertical extent of Galactic HI 565 
cosmic scatter 206 
cosmological 
bound models 397 
closed models 397 
constant 448 
critical models 397 
dust models 397 
Einstein—de Sitter model 397 
Friedmann models 397 
open models 397 
principle 2 
Robertson-Walker metric 397 
unbound models 397 
cosmology §7.1 
counter-rotating gas disks 
in disk galaxies 726 
in ellipticals 529 
in lenticular galaxies 515 
counter-rotating stellar disks 726-727 
Cousins magnitudes 51 
CPC2 736 
CPD 736 
critical density 397 
critical frequency 478 
cross-correlation function 698 
cross correlation method 698 
Curtis, Heber 10 
curve of growth 453 
cusps in brightness profiles 
of elliptical galaxies 191-193 
of the Galaxy 618-619 
of globular clusters 369-371, 374 
cyclotron radiation 478 
Cyg A 245, 247 


D 


Dos 61 

DA stars 93 
DAOPHOT 333 
dark clouds 7, 740 
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dark matter 19-20 
in disk galaxies 510 
in ellipticals 712-713 
halo shape 214 
Galactic 568-569 
in solar neighborhood 663 
data cube 
from 21cm line 488 
for stellar kinematics 706 
Davis—Greenstein effect 140, 520 
DDO system 55 
dE - see dwarf elliptical galaxies 
de Jager limit 274 
de Vaucouleurs radius 62 
de Vaucouleurs R!/* law — see also R1/4 law 
generalized 187 
deceleration parameter 398, 443-448 
declination 29 
deflagration wave 306 
degeneracy, electron 272 
degenerate stars 267-268 
ignition of 269, 272 
Dehnen models 181 
depletion factor 455 
deprojection of galaxy images 179-185 
dex 100 
diameters 
Do5 61 
isophotal 61 
Digital Catalog of Galaxies 69 
Digital Sky Survey 70, 73 
DIRBE 486 
disk galaxies 
21cm line profiles of 503 
abundance gradients in 516 
boxy isophotes in 223 
bulge fraction of 220 
bulge—-disk decomposition of 214-221, 
222, 722 
color gradients in 223-224 
corotation radius of 231 
counter-rotating gas disks in 515, 726 
counter-rotating stellar disks in 726 
disk-to-bulge ratio of 220 
ellipticities of 513 
Freeman’s law for 221 
gas 
atomic hydrogen content of 494-496 
azimuthal distribution of 500-505 
bars and oval distortions in 500, 
512-513 
cut-off density 499 
kinematics of 505-513, §9.1, 549— 
559, 569-570, 580-593 
lop-sided distribution of 502-505 
molecular hydrogen content of 494- 
496 
radial density profile of 498-500 
spiral structure in 500—502 
interstellar matter in Chapters 8 and 9 
magnetic fields in 520-522 
metallicities of 516-520 
correlation with luminosity 519 
gradients in 223-224 
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pattern speeds of 231 
photometry of §4.4 
radio-continuum and IR luminosities of 
496-497 
shapes of 212-214 
star formation rate in 522-525 
stellar kinematics of §11.3 
stellar random motions in 727-729 
stellar rotation curves of 725 
synchrotron emission from 497 
type I and type II systems 216 
velocity ellipsoid in 728 
velocity fields of 505-513 
warps in 505, 510-512 
disk scale length 191 
dependence on waveband 222, 224, 614 
disk—bulge decomposition 214, 722 
diskiness, a4/a 178-179, 256 
of elliptical galaxies 199-201, 204 
of bulges 223 
disks — see also Galactic disk, disk galaxies 
in elliptical galaxies 711, 716 
thick 218 
dispersion measure 482, 574 
distance estimators 
absolute 395, §7.2 
Baade—Wesselink method 399-403 
distances from time delays 405-410 
Sunyaev—Zel’dovich effect 403-405, 
442 
water-maser VLBI proper-motions 
410 
relative 395, §7.3 
from galaxy kinematics 422-426 
luminosity functions 415-418 
surface brightness fluctuations 426— 
429 
variable star luminosities 415 
distance modulus 56 
distances — see also distance estimators and 
parallaxes 
angular diameter 398 
dynamical 79 
luminosity 398 
by moving-cluster method 40-42 
from velocities 38—46 
from Galactic kinematics 540-541 
secular 105 
statistical 105 
distribution function 693 
Dy -— relation 209 
for bulges 724 
as distance estimator 426 
Doppler broadening 455, 694 
Doppler shifts 39, 79, 694 
Draco Dwarf Galaxy 171 
dredge-up 266, 302, 351 
dSph — see dwarf spheroidal galaxies 
DSS - see Digital Sky Survey 
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dust 24 §3.7 
absorption by 138, 609 
cosmology 397 
dust /gas ratio 137 
extinction due to 8, 483-485 
grain sizes in 485, 486 
lanes 13, 153 
in barred galaxies 228, 234, 528 


in elliptical galaxies 201, 204, 528 


in lenticular galaxies 514 

in magnetic fields 140 
photometry and 211-212 
polarization of starlight by 140 
radiation from 483-487 
scattering by 138 
stochastic heating of 486 
temperature of 486, 573 

dwarf elliptical galaxies 150, 190, 209 
luminosity function of 168 
rotation of 710 

dwarf spheroidal galaxies 150, 190 
rotation of 710 

dwarfs 90 — see also black dwarfs, brown 


dwarfs, main sequence, subdwarfs, 


white dwarfs 
luminosity and colors of 107 
McCormack K and M 632, 648 


E 


early-type galaxies 150 
Earth’s atmosphere 46, 49, 51 
Earth mass 77, 744 
E(B — V) - see color excess 
EBL2 736 
eclipses, primary and secondary 84 
ecliptic 29 

obliquity of 29 
Eddington luminosity 255, 257 
edge-on galaxies 217 
effective frequency 134 
effective wavelength 49 
effective yield 520, 688, 690 
EGRET catalog 742 
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elliptical galaxies 15 


atomic hydrogen content of 527 

boxiness of 203 

central cusps in 191-193 

central massive black holes in 717-721 

central velocity dispersions of 205 

chemical evolution of 309-312 

CO content of 528 

colors of 318 

color and line-strength gradients in 193, 
318 

color-magnitude relation for 206 

cool gas in 527-530 

core kinematics of 716-721 

counter-rotating gas disks in 529 

dark halos of 712-713 

diskiness of 199, 204 

disks in 711, 716 

distances from kinematics of 425-426 

Dn-o relation in 209 

dust lanes in 201, 204, 528 

effective surface brightness of 204 

ellipticities of 194, 204 

Faber—Jackson relation for 206 

fine structure of 201 

fundamental plane of 208, 256 

H@ absorption in 193 

interstellar matter in §8.3 

jets from 204 

kinematic mapping of 713-716 

kinematically-distinct cores in 716-717 

line-strength indices of 204 

luminosity function of 167 

major-axis kinematics of 707 

metallicity of 193, 690 

gradients in 312 

Mgg index of 194, 204 

minor-axis rotation in 713 

photometry of §4.3 

radial profiles in 185 

radio luminosity of 199 

ripples in 202, 204 

rotation rate and flattening of 201, 709- 
710 

shapes of 194 

shells in 202 

stellar kinematics of §11.2 

stellar orbits in 712—713 

surface brightness—effective radius rela- 
tion for 205 


Einstein triaxiality of 201, 715 

A and B coefficients 469-470 (v/o)* for 204, 710 

angle 63 X-ray emission from 199, 525 

cross 410 emission lines 463 

Observatory 24 emission measure 480 

radius 63 envelopes, galactic 152 
Einstein-de Sitter model 397 epicycle approximation 543, 728 
electron epoch 36 

charge of 744 equatorial system 27-30 

mass of 744 celestial equator 28 


declination 29 

epoch 36 

meridian 28 

north celestial pole (NCP) 28 
right ascension 29 


electron degeneracy 272 

electron volt 744 

electron-positron annihilation line 483 
electronic configuration 467 
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south celestial pole (SCP) 28 
zenith 28 
zenith distance 28 
equinoxes 29 
equipartition of energy 522 
equivalent width 88, 453 
escape speed 
from Sun 744 
from solar neighborhood 692 
ESO Atlas 69 
ESO/SERC Southern Sky Survey 69 
ESO-Uppsala Catalogue 741 
ESO 565-11 156 
estimated kinematic misalignment 715 
EUV -— see extreme ultraviolet 
expansion parallax 419 
exponential law 191 —- see also disk scale 
length 
extinction 7, 57, 133, 142 
anomalous UV 136 
curve 135, 483 
and the CO X-factor 602 
law 135, 137 
of sight-lines out of the Galaxy 140 
extreme ultraviolet, opacity of interstellar 
matter to 459 


F 


F clusters — see globular clusters, halo 
F stars 92 
Faber—Jackson relation 206, 425 
face-on galaxies 214 
Faraday rotation 248, 481 
[Fe/H] 101 — see also metallicity 
Fermat’s principle 732 
Fermi energy 272 
field galaxy luminosity function 162 
field stars 
formed in clusters 389, 393, 651 
halo 670-678 
filaments of synchrotron emission 575 
file transfer protocol 71 
fine structure in galaxies 201 
index for 203 
fine-structure of atomic levels 467 
FIRST catalog 742 
first overtone mode 288 
FITS — see Flexible Image Transport System 
FK4 system 737 
FK5 system 37, 642, 737 
flat-fielding 174 
Flexible Image Transport System 70 
forbidden region 275 
forbidden transitions 465, 466 
Fornax Dwarf Galaxy 171 
Fourier quotient method 697 
Franklin-Adams Survey 69 
Freeman’s law 221 
Friedmann cosmological models 397 
FR I and II classes of radio galaxies 246 
ftp — see file transfer protocol 
full width at half maximum 49 
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fundamental plane 208, 256 
for bulges 724 
as a distance estimator 426 
fundamental stars 37 
fusion, thermonuclear 259 
FWHM - see full width at half maximum 


G 


G clusters — see globular clusters, disk 
G stars 92 
G-dwarf problem 688, 690 
g-modes 288 
Galactic bar 18, 582, 616-621 
and forbidden velocities 583, 586 
as inferred from microlensing 619-621 
as inferred from individual stars 619-621 
and the parallelogram of CO emission 
588 
Galactic bulge §10.2 
abundance ratios in 622 
age of 621-622 
formation of 687 
kinematics of 622-624 
metallicity of 622 
OH/IR stars in 623-624 
proper motions in 624 
surface photometry of 616-619 
triaxial models for 616-619 
velocity dispersion of 622 
Galactic center §9.5 — see also Sgr A* 
distance to 294, 432, 450 
black hole at 596 
extinction to 609 
mini-spiral at 595 
nuclear molecular ring around 595 
Wolf-Rayet stars near 596 
circular-speed curve near 537, 570 
Galactic clusters — see open clusters 
Galactic CO §9.2, 554-570 
vertical distribution of 562-570 
warp in 568 
Galactic coordinates 30-31 
equator 30 
latitude 30 
longitude 30 
north Galactic pole (NGP) 30 
Galactic dark halo shape 568-569 
Galactic disk 
age of 645 
central 549 §9.4 
dynamical model for 588-593 
at 21-cm 580-586 
lines of CO and CS from 586-588 
lines of OH and H2CO from 580 
corrugation in 563, 666 
free-electron density in 574, 577 
kinematics of §9.1, 549-559, 569-570, 
580-593 
initial mass function of 283-287, 390 
local mass density of 679, 692 
mass of 681 
mass-to-light ratio of 662, 679 
middle 549 
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vertical structure of 563-565 
outer 549 
vertical structure of 565-570 
warp in 565-568 
stellar §10.4 
synchrotron and y-radiation from 575, 
577-579 
thick 654-656, 668 
formation of 686 
stellar velocities in 656 
thin, stellar velocities in 656 
vertical density profile of 653 
X-rays from 579-580 
Galactic globular cluster system 666-670 
luminosity function of 669 
metallicity of 667, 670 
rotation of 668 
Galactic halo §10.5 
chemical evolution of 688-689 
field stars from 670-678 
local surface brightness of 692 
luminosity of 672 
mass of 672 
stellar velocity dispersion of 673 
rotation of 673 
Galactic HI disk §9.2 
deviations from circularity in 553 
extent of 551 
kinematics of §9.1, 549-559, 569-570, 
580-593 
optical thickness at 21-cm of 552 
spin temperature of 552 
vertical distribution of 562-570 
Galactic IR emission 570-574 
Galactic latitude 30 
Galactic longitude 30 
Galactic magnetic field 481, 574-576 
pressure of 576 
Galactic plane 3, 30 
Galactic rotation §9.1 
history of 15-20 
Galactic spiral structure 17, 544-546, 561, 
664-665 
pitch angle of arms in 561 
Galactic stellar disk §10.4 
Galactic warp 
in gas 565-568 
in stars 568 
galaxies with active nuclei 191, 244-255 — see 
also active galactic nuclei 
BL Lacs 249 
blazars 249 
LINERs 245 
Markarian galaxies 245 
N galaxies 248 
OVVs 249 
QSOs 249 
quasars 248 
radio galaxies 246 
Seyfert galaxies 245 
Galaxy models §10.6 
dynamical 683-684 
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history of 5-10, 15-20 
kinematic 683 
mass 680-682 
from star-counts 682-683 
Galaxy 
chemical evolution of 688-690 
circular-speed curve of 555-559 
colors of 614 
formation and evolution of §10.7 
local parameters of 679 
local surface brightness of 614-615 
luminosity density in 609 
luminosity of 614 
at optical wavelengths 614-615 
surface photometry of §10.1 
total mass of 681 
Tully—Fisher relation and 614 
Galilei, Galileo 3 
gas in galactic disks 
atomic hydrogen content of 494-496 
azimuthal distribution of 500-505 
bars and oval distortions in 500, 512-513 
cut-off density 499 
kinematics of 505-513, §9.1, 549-559, 
569-570, 580-593 
lop-sided distribution of 502-505 
molecular hydrogen content of 494-496 
radial density profile of 498-500 
spiral structure in 500-502 
Gaunt factor 460 
Gauss-Hermite series 703 
Gaussian constant, GMg, 744 
GC 736 
General Catalog of Trigonometrical Paral- 
laxes 737 
General Catalogue of Photometric Data 738 
generalized ellipses 229 
giants 90 
luminosity and colors of 107 
physical characteristics of 110 
Gliese Catalog 130, 691, 739 
globular cluster systems §4.5 
color distributions of 239 
metallicity of 240 
Galactic 666-670 
radial density profiles and shapes of 238 
globular clusters 8 151 §6.1 
ages of 336, 344-350, 443 
from A(B — V) method 347 
from AV method 346 
compared with age of Universe 348 
from isochrone fitting 345 
from turnoff point 344 
variations between 349 
stellar kinematics of 371-377 
asymptotic giant branch in 334, 344 
binaries in 359-361 
central black holes in 371, 374 
central cusps in 369-371, 374 
color-magnitude diagrams of 20, 334-344 
concentration 329, 366 
class 331, 330 
core radius of 366 
disk 667 
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halo 667, 680 
chemical evolution of 688 
horizontal branch in 334, 337, 342, 352, 
354 
initial mass function of 356 
luminosities of 329, 330 
luminosity function of 236-237, 416 
luminosity profiles of 363-371 
luminosity segregation in 367-369 
main sequence in 334, 335-337, 340 
mass-to-light ratios of 60, 374 
metallicities of 329, 331, 350-352, 340 
millisecond pulsars in 359, 363 
neutron stars in 362-363 
Palomar clusters 328 
red giant branch in 334, 341 
rotation of 372 
shapes of 239, 328, 372 
spectral types of 329, 331 
stellar density in 354 
stellar luminosity function in 354-359, 
367 
stellar photometry of 332 
stellar proper motions in 375 
stellar remnants in 361-363 
stellar rotation rates in 354 
subgiant branch in 334, 335-337, 340 
tidal radius of 365 
turnoff point in 335 
(v/o)* in 372 
white dwarfs in 361-362 
X-ray binaries in 363 
Gould’s belt 10, 636 
grains 
albedo of 138 
silicate 135 
gravitational constant 744 


gravitational lensing 22, 62-67, 398, 732-735 


time delays as distance estimators 407 
amplification by 66 
Einstein angle 63 
Einstein radius 63 
lensing equation for 65 
optical depth for 64 
strong 64 
surface brightness and 64 
weak 64 
gravitherma]l catastrophe 263 
Great Attractor 440, 447-448 
Great Debate 10 
Greek astronomy 47 
GSC — see HST Guide-Star Catalog 
guiding center 543 
gyro-frequency 478 


H 


h Persei 378, 379 

Ha line — see Balmer lines 

Hale, George Ellery 10 

Hayashi line 274-275, 283 

HB - see horizontal branch 

HD — see Henry Draper Catalog 
HDE — see Henry Draper Catalog 
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HEAO 1 A-4 catalog 742 
HEASARC 73-74 
heavy-element stars 92 
helium abundance 279, 480 
variations in globular clusters 353 
helium diffusion 262, 349 
helium flash 269, 342 
helium lines 92 
radio recombination 480-481 
Helmholtz time 324 
Henry Draper Catalog 737 
Hera, wife of Zeus 2 
Hernquist model 181 
Herschel 
Caroline 3 
John 3 
William 3, 5 
Hertzsprung gap 108, 265, 383 
Hertzsprung, Ejnar 20 
Hertzsprung—Russell diagram 102 — see also 
color-magnitude diagram 


heteronuclear molecules, rotational transitions 


474 
HI — see 21cm line and atomic hydrogen 
high-velocity clouds 530 
high-velocity stars 16 
catalogs of 739 
metallicity of 676 
higher-order velocity moments 701 
HII regions 463 
metallicity of 516-518 
spiral arms and 665 
Hipparcos mission 23, 27, 33, 46, 119, 627, 
632, 642, 680 737 
Hipparcos waveband 52 
Holmberg radius 62 
horizon 28 
horizontal branch 264-265 — see also red 
clump, RR Lyrae stars 
mass of stars on 269 
dependence on metallicity 277 
in globular clusters 334, 337, 342, 352, 
354 
lifetime on 280 
luminosity of 277 
zero age 265, 342 
hour angle, circle 29 
HR diagram 102 ~ see Hertzsprung—Russell 
diagram 
HST — see Hubble Space Telescope 
HST Guide-Star Catalog 737 
Hubble Atlas 69 
Hubble constant 396, 744 
asymptotic value of 441-443 
Hubble diagram 444 
Hubble law 
photometric 187 
modified 180 
cosmological 396 
Hubble sequence 149-157 
barred and normal galaxies on 152 
and dwarf elliptical galaxies 150 
and dwarf spheroidal galaxies 150 
early- and late-type galaxies on 150 


786 


elliptical galaxies on 150 
Irr I and Irr I galaxies 155 
irregular galaxies 155 
lenticular galaxies on 152 
spiral galaxies on 153 
and the tuning-fork diagram 150, 152 
Hubble Space Telescope 23 
instruments on 52 
Hubble stage 155, 220 
Hubble time 396, 744 
Hubble type 
bulge fraction and 220 
gas content and 493 
Hubble, Edwin 14 
Hubble—Oemler law 187 
Huggins, William 4 
Hyades 42, 79, 379 
hydrodynamical simulations of Galaxy 592 
hyperfine levels 467 
hypertext 72 
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IC — see also Index Catalogue 
IC 10 171 
IC 741 
IC 1459 716 
IC 1613 171 
IC 2391 389 
IC 2531 218 
IC 5240 156 
ICRS — see International Celestial Reference 
System 
IMF ~ see initial mass function 
Index Catalogue 4, 13, 67, 741 
inclination, galactic 182 
inflation 444 
Infrared Astronomical Satellite 24, 244, 486, 
568, 570, 624 
infrared cirrus 24, 574 
infrared luminosity 
and radio continuum in disk galaxies 
496-497 
infrared sources, catalogs 742 
initial luminosity function 284 
initial mass function 283-287 
and chemical evolution 688-689 
of Galactic disk 283-287, 390 
of globular clusters 356 
of open clusters 390 
Salpeter 286 
Scalo 287 
instability strip 264, 265, 289 
instantaneous recycling approximation 307 
intercombination transitions 466 
interferometry 19 
intensity 83 
phase 82-83 
speckle 83-84 
intergalactic gas §8.4 
International Celestial Reference System 38, 
642 
international photographic system 49 
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International Ultraviolet Explorer 406, 453 
interstellar clouds 281 
interstellar matter — see interstellar medium 
interstellar medium 7, 451 
y-ray observations of 482 
cooling rate for 310 
detection of §8.1 
in disk galaxies §8.2 
in elliptical galaxies §8.3 
hot component of 24, 311 
local density of 662 
opacity of in extreme ultraviolet 459 
optical emission lines from 463 
radio observations of 468-482 
small-scale structure of §9.6 
X-ray emission from 459-463 
intracluster medium 532 
intrinsic kinematic misalignment 715 
inverse problems 122 
inverse-Compton scattering 483 
IRAS - see Infrared Astronomical Satellite 
iron peak nuclides 298 
Irr I galaxies 155 
Irr II galaxies 155, 241 
irregular galaxies 155 
luminosity function of 167 
IRS 737 
island universes 3 
ISM - see interstellar medium 
isochrone 340 
fitting 345 
isophotes 
a4/a 199, 
boxy 178, 223 
deviations from ellipses in 199 
disky 178 
generalized ellipse 229 
peanut-shaped 223 
shapes of 178-179 
twists in 184, 198 
isopleths 189 
IUE — see International Ultraviolet Explorer 


J 


Jaffe model 181 

Jansky, Karl 17 

Jansky 53 

Java 72 

Jeans equations 683 

jets 241 
and unified model of AGN 251, 255 
in elliptical galaxies 204 
in radio galaxies 246 
stellar 283 

Johnson system 50, 738 


K 


K stars 92 

K giants in Galactic bulge 621-622 
K-correction 57 

Kant, Immanuel 3 

Kapteyn, Jacobus 6 

Kapteyn Universe 6, 9, 15 
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KDCs — see kinematically-distinct cores 
Keck Telescope 22 
Kepler’s third law 76, 80 
kinematic major axis 507 
kinematic mapping 
disk galaxies 722-723 
elliptical galaxies 713-716 
kinematic minor axis 506 
kinematic misalignment 715 
kinematically-distinct cores 716 
kinematics 
gaseous 
of disk galaxies §8.2 
Galactic §9.2 
Galactic center §9.4 
stellar Chapter 11 
of the solar-neighborhood §10.3 
King model 365, 386 
multi-mass 366, 372 
King radius 366 
Kirchhoff’s law 483 
konus densities 182 
kurtosis 702 


L 


L-S coupling scheme 467 
(1, v) plots 536 
axisymmetric expansion in 541 
naive 536-540 
distances from 540-541 
non-circular motion in 541 
oval distortions in 542 
random motions in 546-548 
spiral arms in 544-546, 561-562 
tangent point in 539, 555-556 
terminal velocity in 539, 555-556 
traces in 537 
LANL E-print Archive 73, 74 
Large Magellanic Cloud 155, 156, 171, 238, 
239, 530 
distance to 403, 434-435, 450 
globular clusters in 350, 434 
Las Campanas Redshift Survey 165, 741 
late-type galaxies 150 
leaky-box model 308-312, 678, 688 
lenses 157, 233 
lensing equation 65 
lenticular galaxies 152 
barred 153 
dust in 514 
gas content of 513-516 
luminosity function of 167 
metallicity of 690 
shapes of 212-213 
Leo A Irregular Galaxy 171 
Leo I, Leo If Dwarf Galaxies 171 
LGS 3 Irregular Galaxy 171 
LHS 736 
light curve 84, 87 
light-year 32 
Liller 1 329 
limb darkening 449 
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limiting magnitude 111 
Lindblad resonance 588 
Lindblad, Bertil 15 
line blanketing 55, 340 
line of nodes 195 
line-of-sight velocity distribution 694, 700- 
705 
line profiles, stellar kinematics from §11.1 
line-strength indices — see spectral indices 
LINERs 245 
hosts of 250 
LMC - see Large Magellanic Cloud 
local bubble 579 
local circular speed 551, 614, 679-680 
local disk 
mass density of 656-664 
mass-to-light ratio of 662, 679 
local escape speed 692 
Local Group 169-172, 680 
distances within 432-437 
local interstellar medium density 662 
local sidereal time 29 
local standard of rest 536, 627, 692 
Local Supercluster 440 
logarithmic spiral 226, 501 
long distance scale 429 
long-slit spectrograph 705 
longitude—velocity plots — see (1, v) plots 
lop-sidedness 
of barred galaxies 234 
of gas in disk galaxies 502-505 
Lorentz factor 478 
Lorentzian profile 456 
LOSVD - see line-of-sight velocity distribu- 
tion 
low surface brightness galaxies 170, 221 
low-ionization nuclear emission-line regions — 
see LINERs 
low-luminosity galaxies 164, 186 
Lowell Proper Motion Survey 736 
LSB — see low-surface brightness galaxies 
LSR - see local standard of rest 
luminosity 56 
luminosity density of the Universe 164 
luminosity distance 398 
luminosity function 
of cluster galaxies 165-167 
as a distance estimator 415-418 
of dwarf elliptical galaxies 168 
of elliptical galaxies 167 
of field galaxies 162 
of galaxies 162-169 
of globular clusters 236-237, 416 
in the Galaxy 669 
of irregular galaxies 167 
of lenticular galaxies 167 
of MK spectral classes 127 
of open clusters 119, 389-392 
of planetary nebulae 417-418 
Schechter 163-164, 166 
of spiral galaxies 167 
stellar §3.6, 110 
as function of Galactic height 652 
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in globular clusters 354~—359, 367 
in open clusters 389 
luminosity segregation 
in globular clusters 367-369 
in open clusters 387-388 
lunar occultations 84 
luni-solar precession 36 
Lutz-Kelker Bias 115-119 
Lyman lines 455, 464 


M 


M — see also Messier Catalogue 

M3 329, 330, 334, 335, 349, 351, 360, 
365, 371 

M4 329, 357, 361 

M5 357, 375, 376 

M11 379, 388 

M13 349, 357, 377 

M15 329, 335, 345, 357, 358, 359, 369, 


370, 372, 374 

M19 328, 330 

M30 367 

M31 3, 11, 15, 171, 190, 223, 239, 498, 
511, 521 


distance to 435 
novae in 420 
planetary nebulae luminosity func- 
tion of 418 
M32 150, 171, 190, 191, 718, 720 
M33 171, 238, 411, 512 
M41 383 
M44 379 
M45 379, 380 — see also Pleiades 
M49 239 
M51 4, 224, 228, 500, 521 
M67 383 
M68 357 
M81 224, 491, 501 
M83 510 
M87 151, 238, 240, 246, 717, 718 
M100 224, 439 
M101 502 
M104 722 
M stars 92 
Magellanic Clouds 689 - see also Large Mag- 
ellanic Cloud, Small Magellanic 
Cloud 
Magellanic irregular galaxies 156 
Magellanic Stream 530-531 
magnetic fields 
Davis—Greenstein effect in 140, 520 
in disk galaxies 520-522 
effect on dust grains of 140 
Galactic 481, 565 
in pulsars 88 
magnitude §2.3 
absolute 56-57 
apparent 47-52 
bolometric 56, 58-60 
limiting 111 
magnitude-limited 111 
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main sequence 108, 263, 264 
fitting 336, 383 
in globular clusters 334, 335-337, 340 
lifetime on 280 
luminosity on 280 
metallicity dependence of 276 
in open clusters 381 
stars 
luminosity function of 129 
mass—luminosity relation for 109 
mass-—radius relation for 109 
physical properties of 109-110 
theoretical 260 
Malmquist bias 111-115, 125, 128, 661 
Markarian galaxies 245, 742 
Mrk 231 496 
mas — see milli-arcsecond 
masers 411 
mass loss, stellar 265, 273, 275 
mass-cut 304 
mass-luminosity relation, stellar 284 
mass segregation 
in open clusters 388 
in globular clusters 367-369 
mass-to-light ratio 60, 125 
local 125, 662, 679 
MCG 741 
mean velocities, stellar 697-700 
mergers 242-244, 320 
meridian 28 
meridian circle 37 
Messier Catalog 3, 328, 377 
Messier, Charles 3 
metal lines from interstellar matter 464—468 
metal production 
by low mass stars 301-302 
by supernovae 303-306 
metallicity 97, 306 
degeneracy with age of 322-324 
of disk galaxies 516-520 
and correlation with luminosity 519 
of elliptical galaxies 193 
of Galactic bulge 622 
of globular clusters 340, 350-352, 353, 
667, 670 
of globular-cluster systems 240 
of high-velocity stars 676 
of nearby stars 643-650 
of open clusters 381 
from spectral indices 322-324 
metals 21, 93 
Mg 194, 204, 322 
Michie model 373 
multi-mass 373 
Michigan Catalog of Spectral Types 737 
microlensing — see also gravitational lensing 
as probe of Galactic bar 619-621 
Milky Way 1 - see also Galaxy and Galactic 
milli-arcsecond 33 
millisecond pulsars 363 
mini-spiral 595 
minor-axis rotation in elliptical galaxies 713 
Mira variables 95, 292 
as probes of Galactic bar 620 


Index 


missing mass problem 19 — see also dark mat- 
ter 
mixing length theory 261 
MK luminosity classes, calibration of 104 
MK spectral classes 109 
luminosity and color of 104-108 
luminosity functions of 127 
MK system 90-94 
luminosity classes in 90, 92 
molecular bands 92, 457 
molecular clouds 
Galactic 575 
mass of 600 
molecular fraction in 458 
scattering of stars by 650 
small-scale structure in 598-603 
and spiral structure 561 
velocity dispersion of 599 
molecular ring 555 
molecules 
Hz 137, 456-458 — see also X factor 
heteronuclear 474-478 — see also CO 
emission 
spectral lines from 456 
moments of the velocity distribution 702-703 
Moon, mass of 77 
moving groups 386, 636 
moving clusters method 40-42 
MS — see main sequence 
multi-mass King model 366, 372 


N 


N galaxies 248 
Narrabri interferometer 83 
NASA Atlas 69 
NASA Extragalactic Database 72 
natural width 456 
NCP - see north celestial pole 
nearby stars 
a elements in 643 
[a/Fe] for 647 
abundances in 643 
correlation with age 644-650 
ages of 643-650 
[Fe/H] for 644 
kinematics of 810.3 
metallicity of 643-650 
s-process elements in 643 
nebulae 3, 380 — see also planetary nebulae, 
spiral nebulae 
catalogs of 740 
NED - see NASA Extragalactic Database 
neutrinos, production in supernovae of 303 
neutron poisons 300 
neutron stars 87, 268, 271 
in globular clusters 362-363 
New General Catalogue 4, 13, 741 
NFW law 510 


NGC -— see also New General Catalogue 
NGC 67 
NGC 104 329 
NGC 147 150, 171, 237 
NGC 185 171 
NGC 188 379, 385 
NGC 205 150, 171 
NGC 214 217 
NGC 283 191 
NGC 288 329, 338, 349, 352, 354, 360 
NGC 362 329, 338, 349, 352, 354 
NGC 462 191 
NGC 488 725, 729 
NGC 596 191 
NGC 628 154 
NGC 720 188, 525 
NGC 741 
NGC 753 508 
NGC 801 508 
NGC 855 496 
NGC 869 379 
NGC 884 379 
NGC 891 12, 217, 497 
NGC 1055 233 
NGC 1068 252, 253 
NGC 1199 188 
NGC 1209 188 
NGC 1275 240 
NGC 1291 155 
NGC 1300 155, 231, 234 
NGC 1381 232 
NGC 1395 188 
NGC 1426 188 
NGC 1543 156 
NGC 1700 185 
NGC 1851 329, 338 
NGC 1866 434 
NGC 2023 487 
NGC 2024 476 
NGC 2068 487 
NGC 2264 379, 384 
NGC 2266 383 
NGC 2366 496 
NGC 2420 359, 391 
NGC 2477 359, 391 
NGC 2516 383, 389 
NGC 2632 379 
NGC 2685 515 
NGC 2808 330, 331, 369 
NGC 2811 154 
NGC 2841 154, 215, 498, 524 
NGC 3109 171 
NGC 3198 509 
NGC 3201 372, 374 
NGC 3377 151 
NGC 3379 151, 236 
NGC 3593 727 
NGC 3626 726 
NGC 3628 533 
NGC 3680 392 
NGC 3898 215 
NGC 3921 320 
NGC 3923 202 
NGC 3992 19 
NGC 4194 496 


789 


790 


NGP — see north galactic pole 


NGC 4254 524 
NGC 4258 412 
NGC 4261 151 


NGC 4303 516, 518 
NGC 4371 153, 229 
NGC 4406 713, 716 
NGC 4472 239, 461 


NGC 4477 229 


NGC 4486 151, 238 


NGC 4548 235 
NGC 4550 726 
NGC 4565 220 
NGC 4570 218 
NGC 4571 439 
NGC 4622 226 


NGC 4755 382, 383 
NGC 4881 189, 417 


NGC 4976 153 
NGC 5018 711 
NGC 5033 490 
NGC 5053 329 
NGC 5128 146 
NGC 5139 329 
NGC 5152 228 
NGC 5194 215 
NGC 5266 528 
NGC 5272 329 
NGC 5383 512 


NGC 5746 223, 233 


NGC 5831 199 
NGC 5907 504 
NGC 5982 716 
NGC 6121 329 
NGC 6356 668 
NGC 6397 359 
NGC 6558 369 
NGC 6624 369 
NGC 6674 499 
NGC 6705 379 
NGC 6752 362 
NGC 6782 156 
NGC 6791 379 
NGC 6822 171 
NGC 6861 153 
NGC 6864 369 
NGC 6946 498 
NGC 7023 486 
NGC 7078 329 


NGC 7252 243, 320 


NGC 7261 379 
NGC 7331 498 
NGC 7332 722 


NGC 7741 155, 231 


NGC 7793 154 


NICMOS 52 


night sky brightness 173-176 


NLYTT 736 


non-optical sky surveys 70 
north celestial pole 28 


north galactic pole 30 
Northern Proper Motion Catalog 38, 736 


mean 36 


novae 11, 87, 96 


luminosities of 419 
as standard candles 420 
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NPM -— see Northern Proper Motion Catalog 


nuclear molecular ring 595 
nuclear physics 259-260, 296-301 


nuclides 296 


O 


a 297 

iron peak 298 
primary 297 
r-process 300 


radioactive decay of 296 


s-process 298 


secondary 298 


stability band or valley for 296-297 
nutation 35-36, 36 
NVSS 742 


O stars 92, 134 
OB stars 134 


as tracers of Galactic warp 568 


OB associations 378 


heating dust 574 


OH/IR stars 95, 266 


as Galactic bar probes 620 

as Galactic bulge probes 623 
on-line resources 71-74 

Oort constants 637-642, 679, 681, 691 


A 639 
B 640 


determination of 641 


values for 642 


Oort, Jan 16 


Oosterhoff classes 337, 338 


open clusters 8, §6.2 


optical doubles 78, 360 
optical sky surveys 69 
optically violently variable quasars 249 


ages of 379, 384-386 


color-magnitude diagram for 20, 381-384 


concentrations of 386 


core radii of 386 


demise of 384-386 


diameters of 379 


initial mass function of 390 
luminosities of 377, 379 


luminosity functions in 119, 389-392 


luminosity profiles of 386-387 

luminosity segregation in 387-388 

main sequences of 381 

mass segregation in 388 

metallicities of 379, 381 

old 379, 384, 385-386, 651 
metallicities of 651 

proper motions in 388 

rotation curve from 557 


sizes of 377 


and spiral arms 665 
stellar kinematics of 388-389 
third parameter in 384 


tidal radii of 386 


Trumpler classifications of 378, 379, 386 
optical depth 469 


Index 


orbits 
apocenters of 543 
cusped 588 
in elliptical galaxies 712-713 
epicycle 543 
guiding centers of 543 
inner Lindblad resonance of 588 
pericenter of 543 
spiral structure and 544 
21, 22 588 
Orion B cloud 575 
Orion—Cygnus arm 561, 665 
oval distortions in disk galaxy gas 500, 512— 
513 
OVVs -— see optically violently variable 
quasars 


P 


P-Cygni profile 101 
p-modes 288 
p-p chain 259 
PAH — see polycyclic aromatic hydrocarbon 
pair-production instability 271 
Pal — see also Palomar clusters 

Pal 7 330, 331 

Pal 11 668 

Pal 12 349 
Palomar clusters 152, 328 
Palomar sky survey 69 
parallactic ellipse 32 
parallax 23, 31-34, 32, 105 

absolute 33 

photometric 32 

relative 33 

trigonometric 32 

by moving-cluster method 41 

secular 42 
parsec 6, 32, 744 
Parsons, William, Third Earle of Rosse 4 
Paschen spectral lines 464 
pattern speeds 231, 543 

in spiral structure 501 
pc — see parsec 
peculiar velocities 439, 447 
Pegasus Irregular Galaxy 171 
pericenter 543 
period-luminosity relation 289 
period—luminosity—color relation 291 
permeability of space 744 
permittivity of space 744 
Perseus arm 562, 600, 665 
Perseus cluster 240 
Perseus—Pisces supercluster 448 
PGC 741 
Phoenix Irregular Galaxy 171 
photometry 47 

of disk galaxies §4.4 

of elliptical galaxies §4.3 
photovisual system 49 
physical constants 744 
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pitch angle 
of electron orbit 478 
of spiral arms 226 
Galactic 561 
PKS14134135 250 
PL relation — see period—luminosity relation 
Planck function 58 
Planck’s constant 744 
planetary nebulae 4, 267, 302 
Galactic rotation curve from 557 
luminosity function of 417-418 
plasma frequency 482 
PLC relation — see period—luminosity—color 
relation 
Pleiades 42, 379, 380, 390 
Plummer law 655 
point-spread function 176 
Poisson’s equation 657, 691 
Poisson’s integral 735 
polar-ring galaxies 515 
polarization 
by dust 140 
and the Davis—Greenstein effect 140 
polycyclic aromatic hydrocarbons 487 
population I, II 20 
population III 21 
position angle 30 
position-velocity diagrams 705-706, 722, 726 
positional astronomy §2.1 
absolute 37 
relative 37 
POSS-I, POSS-II 70 
PPM 736 
Praesepe — see M44 
pre-enrichment 688-690 
pre-main-sequence evolution 281-283 
precession 35-36 
principal component analysis 208 
projected kinematic misalignment 715 
proper motions of stars 6, 34-35 
in the bulge 624 
near the Galactic center 596 
in globular clusters 375 
Oort constants from 640 
in open clusters 388 
and the solar motion 625 
proton mass 744 
protostars 282 
pulsars 87-88 
magnetic field of 88 
millisecond 363 
as probes of Galactic ISM 574-577 
pulsating stars 287-296 — see also 6 Scuti 
stars, Cepheids, Mira variables, RR 
Lyrae stars, W Virginis stars 
«-mechanism in 288 
first overtone mode of 288 
&-modes in 288 
and the instability strip 289 
p-modes in 288 
period-luminosity relation for 289 
pulsation constant 289, 325 
PV diagram — see position—velocity diagram 
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QSO ~ see also quasi-stellar objects 

QSO 0957+561 409 

QSO 22374031 410 
quasars 594, 605 

catalogs of 742 

hosts of 250 

X-ray background contribution from 580 
quasi-stellar objects 249 

catalogs of 742 

hosts of 250-251 

and the unified model 253 
quasi-stellar radio sources — see quasars 


R}/4 law 186, 219, 256 
effective radius of 186 
effective surface brightness of 204—205 
r-process nuclides 300 
radial distributions of HI and CO 559-560 
radio continuum 
correlation with infrared luminosity in 
disk galaxies 496-497 
from active galaxies 530 
radio frequency bremsstrahlung 480 
radio galaxies 246 
broad-line 247 
FR I and FR II 246 
hot spots in 246 
jets in 246 
lobes in 246 
narrow-line 247 
radio detection of interstellar matter 468-482 
radio recombination lines 480-481 
radio sources 
catalogs of 742 
as standard rulers 445 
radioactive decay 296 
radius 
anisotropy 373 
core 366, 386 
de Vaucouleurs 62 
effective 186 
Einstein 63 
in Galaxy from kinematics 540-541 
Holmberg 62 
isophotal 61-62 
King 366 
stellar 82 
tidal 365, 386 
Rayleigh scattering 7 
RC — see red clump 
RC3 741 
recombination lines 480-481 
red clump 103, 264, 265, 278, 337 
stars as probes of Galactic bar 620 
red giant branch 106, 109, 264-265 
dependence on metallicity of 276 
in globular clusters 334, 341 
stars 109-110 
red giants 20 
reddening 7, 132, 133 
line 134 
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reddening-free indices 138 
redshift 39 
reflection nebulae 132, 138, 740 
relaxation time 387 
remnants, stellar 272-274 — see also white 
dwarfs, neutron stars, black holes 
RGB - see red giant branch 
Richardson—Lucy algorithm 180, 182, 618 
right ascension 29 
rings 156-157 
in barred galaxies 233-234 
Galactic molecular 555 
inner 157 
nuclear 157 
outer 157 
ripples 202, 204 
Robertson—Walker metric 397 
ROSAT Satellite 249, 579 
catalogs from 742 
Rosse, Third Earl of 4 
rotation curves — see circular-speed curves 
rotation measure 481, 522 
rotational transitions in molecules 474 
RR Lyrae stars 95, 265, 293-294, 337, 400, 
433, 680 
Bailey types of 294 
Galactic center distance from 294 
luminosity dependence on metallicity of 
295 
metal-poor 671, 673 
period-luminosity relation for 295 
as standard candles 415 
RSA 741 
Russell, Henry Norris 20 


S 


S stars 92, 101 
s-process nuclides 298 

in nearby stars 643 
SO galaxies — see lenticular galaxies 
Sagittarius arm 561 
Sagittarius—Carina arm 561, 665, 666 
Sagittarius Dwarf Galaxy 170, 171 
Sagittarius Dwarf Irregular Galaxy 171 
Salpeter initial mass function 286 

in globular clusters 356-359 

in open clusters 391 
SAO Catalog 736 
satellite galaxies 

accretion of Galactic 686 
SBO — see lenticular galaxies, barred 
Scalo initial mass function 287 
Schechter function 163-164, 166 
Schmidt law 524 
Schwarzschild criterion 260 
Schwarzschild distribution 632, 683 
Schwarzschild radius 597, 720 
Scorpio—Centaurus group 42 
SCP - see South Celestial Pole 
Sculptor Dwarf Galaxy 171 
Sculptor system 151 
sdG stars 93 
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SDSS — see Sloan Digital Sky Survey spectra 
sech? law 217, 612 classification of 88, 89 - see also classifi- 
second parameter problem 352 — see also cation, spectra 
third parameter problem early type 91 
secular parallaxes 42-45 late type 91 - 
seeing 35, 88, 176-178, 332 physical interpretation of §3.4 
selected areas 6 epee tape ue 
; : sit spectral indices 99 
semi-forbidden transitions 466 age determination from 322-324 
sequences 103 — see also asymptotic giant Ca II 92 
branch, horizontal branch, main se- in elliptical galaxies 204 
quence, red clump, red giant branch, HB 193 
subgiant branch and white dwarf 
cooling sequence 
SERC Equatorial Extension Survey 69 
Sextans Dwarf Galaxy 171 
Sextans A, B Irregular Galaxies 171 
Seyfert galaxies 245 
hosts of 250 
type 1 245, 249, 253 
type 2 245, 253 
X-ray background contribution from 580 
SGB - see subgiant branch 
SGP — see south Galactic pole 
Sgr A* 30, 595, 623, 642, 679 
luminosity of 596 
mass of 596 
Sgr A East, West 595 
Sgr B2 432, 587 
Shapley, Harlow 8 
Shapley—Ames Catalog 150 — see also RSA 
shear 641 
shells 202 
short distance scale 429 
sidereal year-744 
SIMBAD 72,73 
Sirius 52 
skewness 702 
Sloan Digital Sky Survey 69, 71 


metallicity determination from 322-324 
Mgz 194, 322 
Z-sensitivity of 322 
spectral lines 
Doppler broadening of 455, 694 
equivalent width of 88 
forbidden 465, 466 
intercombination 466 
Lorentzian profile 456 
molecular 456 
natural width of 456 
P-Cygni profile 101 
saturated 454 
semi-forbidden 466 
spectral type 89 
spectral velocity 694 
spectrograph 
fiber-fed 706 
long slit 705 
spectrophotometry 88 
spectroscopic notation 467 
spectroscopic term 467 
speed of light 744 
spider diagram 506 
spin temperature 472 
of Galactic HI 552-553 
Spindle Galaxy 515 
spiral arms 153, 157 — see also spiral struc- 


Small Magellanic Cloud 156, 171, 234, 239 ture 
530 arm classes of 157 
SMC - see Small Magellanic Cloud in disk galaxies 224-228 


solar flocculent 224 
absolute magnitude 53, 56, 744 Galactic tracers of 664 
abundances 100 grand-design 224 
luminosity 53, 56, 744 in ISM 560 
mass 77, 744 (1, v) plots and 544-546 
radius 744 pitch angle of 226, 501 
solar motion 43, 624-628 scattering of stars by 650 
antapex of 625 spiral galaxies 15 
apex of 625 21cm line profiles of 423, 503 
solar-neighborhood stars — see nearby stars distances from kinematics of 422-425 
Sombrero Galaxy — see M104 early-type 154 
source function 469 late-type 154 . 
south celestial pole 28 luminosity function of 167 
. normal 153 
south Galactic pole 30 


shapes of 213-214 
Space Shuttle 609 spiral nebulae 5, 10-15 
spallation 482 


: spiral structure 5 — see also spiral arms 
specific frequency of globular clusters 237, in gas of disk galaxies 500-502 


orbits in 544 
pattern speed of 501 
vertex deviation and 637 


238 
specific intensity 468 
speckle 83 
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SR — see standard of rest 
SRS 736 
stability band or valley 296-297 


standard candles 169, 395, 415-422, 445-446 


standard of rest 637 
standard rulers 395, 445-446 


star clusters 4, Chapter 6 — see also globular 


clusters, open clusters 
catalogs of 740 
convergent points of 40 
star counts 607, 651-654 
Galactic models from 5-10, 682-683 
star formation 281-283 
in disk galaxies 522-525 
star gauging 5 
star streams 634-636 
Hyades stream 635 
Pleiades stream 635 
Sirius stream 635 
starburst galaxies 241-244 
stars 
classification of 87 
high-velocity 676 
initial masses of 267-274 
lifetimes of 279-281 
luminosities of 279-281 
mass-luminosity relation for 284 
masses of §3.1 
metal-poor 670-673 
pulsating — see pulsating stars 
radii of 82 
remnant masses of 272 
variable — see pulsating stars 
weak-lined 91 
statistical parallaxes 45-46 
of globular cluster stars 376 
Stefan’s law 86 
Stefan-Boltzmann constant 744 
stellar birthrate parameter 321, 523 
stellar kinematics Chapter 11 
of bulges 723-724 
of cores of ellipticals 716-721 
of disk galaxies §11.3 
of elliptical galaxies §11.2 
techniques for measuring §11.1 
stellar populations 20-21 
colors of 318-319 
evolution of §5.4 
luminosity of 315-318 
mass-to-light ratio of 318 
metallicity distribution of 308 
metallicity/age degeneracy in 322-324 
stellar radii 82, 109, 399 


stellar remnants — see also white dwarfs, neu- 


tron stars, black holes 


stellar rotation rates in globular clusters 354 


stellar structure equations 259 
stellar velocity distributions §11.1 
higher order moments 702-703 
mean velocities 697-700 
velocity dispersions 697-700 
stochastic heating of dust 486 
StrOmgren sphere 463 
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Strémgren uvbyf8 photometric system 55, 140, 


142, 643, 738 
subdwarfs 93, 276, 336, 673 
subgiant branch 103, 264, 265 
in globular clusters 334, 335-337, 340 
Sunyaev—Zel’dovich effect 403-405 
supergiants 90 
luminosities and colors of 108 
Supernova 1987A 304, 435 
Baade—Wesselink distances to 403 
ring around 406-407 
supernovae 96, 270, 302-306, 598 
core-collapse 303, 644 
Baade—Wesselink distances to 402-403 
Galactic 577 
metal production in 303 
neutrino production by 303 
type I 272, 302 
type Ia 302, 644 
luminosities of 420 
as standard candles 421, 445 
type Ib 302 
type Ic 303 
type II 302 
surface brightness 61, §4.2 
of bulges 222-223 
of disk galaxies 215-216 
effective 204-205 
of elliptical galaxies 185-193, 204 
fluctuations in 426-429 
and gravitational lensing 64 
surface gravity 82, 94, 109 
surface photometry §4.2 
surveys 67-70, 549-550, 554, Appendix B 
of CO in the Milky Way 553 
effective volume of 144 
ESO/SERC Southern 69 
Franklin~Adams 69 
of HI in the Milky Way 550 
kinematically selected 673-676 
Leiden—Dwingeloo-IAR 550 
non-optical 70, 742 
objective prism 249 
optical 69 
Palomar 69 
photometrically complete 119 
proper-motion selected 120 
redshift 163, 741 
SERC Equatorial Extension 69 
Sloan Digital 69, 71 
volume-limited 113 
synchrotron radiation 58, 478-480, 576 
from AGN 253 
correlation with CO 579 
from disk galaxies 497 
filaments 594 
from the Galactic disk 575, 577-579 
synchrotron self-absorption 479 
systemic velocity 507 
SZ effect — see Sunyaev—Zel’dovich effect 


T 


T associations 378 
T Tauri stars 95, 283, 378, 380 
tangent point 539 
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telnet 72 
temperature 

color, relation to 262 

effective 94, 97, 98, 109 

spin 472 

of Galactic HI 552-553 

template mismatch 699 
terminal velocity 539, 680-681 

curve 555, 562, 563 
thermonuclear fusion 259 
thick disks 218 
third parameter problem 352-354 

in open clusters 384 
Thomson scattering 257, 325 

cross-section for 479, 744 
Thuan-—Gunn photometric system 52-53 
tidal radius 

of globular clusters 365 

of open clusters 386 
time delay 

gravitational lens systems 407-410 

Supernova 1987A 407-407 
traces 537 
triaxiality in ellipticals 

photometric evidence for 183-185, 198~- 

199, 201 

kinematic evidence for 713-716 
tropical year 29 
Trumpler classification 378, 379, 386 
truncated Gauss-Hermite series 703 
Tucana Dwarf Galaxy 171 
Tully-Fisher relation 422-425 

for the Galaxy 614 
tuning-fork diagram 150, 152 
turnoff point 335, 344 
Tycho Catalogue 737 


U 


U component of solar motion 627 
UBVRI photometric system 53 
UBV photometric system 50 

UGC 741 

ultra-violet excess 277 
ultraviolet-blue-visual system 50 
Unified Resource Locators 72 
Universe, age of 396, 443, 448, 449 
unresolved Gaussian decomposition 704 
unsharp masking 179 

URL — see Unified Resource Locators 
Ursa Major group 42 

Ursa Minor Dwarf Galaxy 171 


Vv 


V component of solar motion 627 

(v/o)* 204, 709, 710 

van de Hulst, H.C. 17 

van Maanen, Adriaan 14 

variable stars — see 6 Scuti stars, Cepheids, 
Mira variables, pulsating stars, RR 
Lyrae stars, W Virginis stars 

VE — see vernal equinox 


795 


Vega 52, 58 
velocities 
forbidden 538, 583, 586, 592 
heliocentric 39 
radial 38, 39, 43 
from absorption-line spectra §11.1 
in Galactic disk 537 
and Oort constants 639 
and solar motion 625 
random 
in bulges 723~724 
in disk galaxies 727-729 
in elliptical galaxies 707-713 
of stars near the Sun 629-637 
redshift 39 
space 34, 38 
spectral 694 
transverse 34 
velocity dispersions 697-700 
of bulges 723-724 
of disk galaxies 727-729 
of elliptical galaxies 707-713 
of Galactic bulge 622 
of Galactic disks 656 
of Galactic halo 673 
of stars near the Sun 629-637 
velocity ellipsoid 631 
of disk galaxies 728 
local 660 
velocity fields in disk galaxies 505-513, 725- 
727 
velocity-dispersion tensor 712 ~ see also ve- 
locity ellipsoid 
vernal equinox 29 
mean 36 
vertex deviation 630 
causes of 636-637 
Very Large Array 19, 488 
very long baseline interferometry 411 
vignetting 174 
Virgo Cluster 163, 418 
distance to 438-439, 440 
gas content of galaxies in 495 
Virgocentric infall 440, 441 
VLA — see Very Large Array 
VLBI — see very long baseline interferometry 
vorticity 641, 691 


W 


W component of solar motion 627 
W Virginis stars 96, 293 
wA stars 93 
warps 
of disk galaxies 505, 510-512 
Galactic 565-568 
water masers 411-413, 432 


796 


wavebands 48-57 
Cousins 51 
DDO 55 
effective wavelength of 49 
FWHM of 49 
Hipparcos 52-53 
Johnson 50 
SDSS 53 
Strémgren 55 
Thuan-—Gunn 52-53 
Tycho 52-53 
UBVRI 53 
UBV 50, 51 
WC stars 94 
Westerbork Synthesis Radio Telescope 19, 
488, 499 
WFPC2 52 
wG stars 93 
Whirlpool Galaxy — see M51 
white dwarfs 93, 103, 268 
cooling sequence of 264, 281, 267 
in globular clusters 361-362 
luminosities of 108 
wind, stellar 101, 261, 265 
WLM Irregular Galaxy 171 
WN stars 94 
Wolf-Rayet stars 94, 95, 101 
at Galactic center 596 
World Wide Web 72 
worms 598 
Wright, Thomas 3 
WWW - see World Wide Web 


Xx 


X, hydrogen abundance 101 
X factor for CO 478 

from +y-rays 601 

from Ay 602 

from virial masses 601 


Index 


X-ray astronomy 23 
catalogs 742 
X-ray binaries 96 
in globular clusters 363, 367 
X-ray emission 458-463 
bremsstrahlung 460 
from clusters of galaxies 404 
from elliptical galaxies 525 
Galactic diffuse 579-580 
from interstellar matter 459-463 
line radiation 461 
spectra 461 


Y 


Y 101 - see also helium abundance 
Yale Bright Star Catalog 739 
year, tropical 29 
yield 307 
effective 520, 688, 690 
young star distribution 664-666 


Z 


Z 101 - see also metallicity 
Z-sensitivity of spectral indices 322 
ZAHB - see zero-age horizontal branch 
ZAMS - see zero-age main sequence 
zenith 28 

zenith distance 28 

zero-age horizontal branch 265, 342 
zero-age main sequence 106 

Zodiacal light 173 

zone of avoidance 13, 140 
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